110 Commits

Author SHA1 Message Date
jonaswinkler
ca397c5678 enable deskewing and rotation by default 2021-02-21 23:40:26 +01:00
jonaswinkler
e3dd1863a9 completely reworked the OCRmyPDF parser. 2021-02-21 00:16:57 +01:00
Fabian Koller
3d0a1e4185 Only provide legal cors allowed hosts in example
Addresses that do not carry a scheme cause errors on database migration.


SystemCheckError: System check identified some issues:

ERRORS:
?: (corsheaders.E013) Origin 'example.com' in CORS_ALLOWED_ORIGINS is missing scheme or netloc
    HINT: Add a scheme (e.g. https://) or netloc (e.g. example.com).
2021-01-14 21:14:46 +01:00
jonaswinkler
c710db7aab add sslmode configuration option fixes #298 2021-01-10 01:35:56 +01:00
Jonas Winkler
499baf6694 Merge pull request #260 from shamoon/feature/remote-user
Feature: authentication via HTTP_REMOTE_USER
2021-01-06 14:27:14 +01:00
Jonas Winkler
22f45ac619 Merge pull request #251 from jayme-github/ignore-date
Add option to ignore certain dates in parse_date
2021-01-05 00:19:13 +01:00
jayme-github
34421e2cef Add missing config options to example file 2021-01-03 14:35:28 +01:00
Michael Shamoon
8caa61bd9d Merge remote-tracking branch 'upstream/dev' into feature/remote-user 2021-01-03 00:38:10 -08:00
Michael Shamoon
009de5184f Allow authentication via HTTP_REMOTE_USER 2021-01-03 00:37:19 -08:00
jonaswinkler
9d64fb6669 config file 2021-01-02 15:30:52 +01:00
jonaswinkler
9d5db0dcfd Merge branch 'master' into dev 2020-12-31 02:15:01 +01:00
Fabian Koller
0119c247f6 Fix ENV var name for user args in example config
The actual string used when looking up the user arguments ends with an S: `PAPERLESS_OCR_USER_ARGS`
2020-12-29 23:52:27 +01:00
jonaswinkler
fe73f42495 added configuration option for the font #197 #207 2020-12-29 12:26:41 +01:00
jonaswinkler
e02838b198 fixes #122 2020-12-11 17:49:32 +01:00
jonaswinkler
388f6cfbe6 reorganised settings documentation and added OCR_USER_ARGS 2020-11-29 12:38:32 +01:00
Jonas Winkler
3b655c95d9 added image DPI detection to the tesseract parser. 2020-11-25 19:37:48 +01:00
Jonas Winkler
8b1aa9cf68 removed unused settings. 2020-11-25 19:30:11 +01:00
Jonas Winkler
15935ab61f reworked PDF parser that uses OCRmyPDF and produces archive files. 2020-11-25 14:50:43 +01:00
Jonas Winkler
b275297a05 added setting PAPERLESS_AUTO_LOGIN_USERNAME 2020-11-23 22:50:02 +01:00
Jonas Winkler
ae198f0767 new setting: PAPERLESS_OCR_PAGES 2020-11-22 12:54:08 +01:00
Jonas Winkler
99c5ecd99a moved the documentation of the config
to the actual docs.
2020-11-19 13:18:57 +01:00
Jonas Winkler
8f95fb1c8b removed all quotes from the config: they are not needed and are confusing for the docker-compose env files, where they are actually not allowed. 2020-11-18 22:42:56 +01:00
Jonas Winkler
4230a0a474 a new setting that allows you to skip thumbnail optimization. 2020-11-18 22:42:05 +01:00
Jonas Winkler
24bb8c71c9 Merge branch 'dev' into mail_rework 2020-11-17 00:23:10 +01:00
Jonas Winkler
2119eb4c15 added option for polling 2020-11-16 18:52:13 +01:00
Jonas Winkler
b6d2943e89 added config options to conf example 2020-11-16 18:37:12 +01:00
Jonas Winkler
d99b4623f8 first implementation of the mail rework 2020-11-15 23:56:22 +01:00
Jonas Winkler
3048342de7 added a setting: delete duplicate documents 2020-11-10 01:47:58 +01:00
Jonas Winkler
d3e7c8ff4e added a task scheduler for recurring tasks 2020-11-09 20:29:10 +01:00
Jonas Winkler
0f498f1016 removed no-login option since it wasn't working with the new django version anyway. 2020-11-09 15:28:45 +01:00
Jonas Winkler
548941b1bb added the filename handling back into the code 2020-11-08 13:00:45 +01:00
Jonas Winkler
64ee9a4941 updated settings: docker image runs without ENV variables 2020-11-03 12:23:24 +01:00
Jonas Winkler
749d152347 updated configuration 2020-11-02 21:59:36 +01:00
Jonas Winkler
6adc870a20 silenced unpaper, optipng for cleaner output
moved parser settings to settings
removed forgiving ocr (now default) since tesseract is plenty accurate even without defining the correct language.
2020-11-01 23:23:42 +01:00
Jonas Winkler
46d1fdc108 fixed folders, compatible with previous paperless version 2020-10-29 14:30:15 +01:00
Jonas Winkler
3fee159306 config cleanup 2020-10-29 00:34:37 +01:00
Jonas Winkler
ba210d3b5e changed a few things 2020-10-27 17:08:18 +01:00
Jonas Winkler
93d963ed4e added
- document index
- api access for thumbnails/downloads
- more api filters

updated
- pipfile

removed
- filename handling
- legacy thumb/download access
- obsolete admin gui settings (per page items, FY, inline view)
2020-10-25 23:03:02 +01:00
Jonas Winkler
67d9040580 removed reminders, changed a bunch of stuff 2020-10-20 00:35:39 +02:00
Jonas Winkler
b71049ad16 Merge branch 'master' into dev 2020-10-16 15:02:57 +02:00
Wolf-Bastian Poettner
85a0560d77 Allow filename to use tags[KEY] and tags[INDEX] 2020-03-28 21:06:17 +00:00
Wolf-Bastian Poettner
7391f5c1a2 Tags in filenames can now either be found by key (in case - or _ are
used as delimeter) or by index. Added some more tests.
2020-02-01 20:58:44 +00:00
Wolf-Bastian Poettner
a5518a307c Unified PAPERLESS_DIRECTORY_FORMAT and PAPERLESS_FILENAME_FORMAT into
one configuration setting
2020-02-01 20:09:36 +00:00
Wolf-Bastian Pöttner
2d98951d88 Update paperless.conf.example
Co-Authored-By: Pit <pitkley@googlemail.com>
2020-02-01 14:22:02 +01:00
Wolf-Bastian Poettner
d1a54d6576 Allows to configure directory and filename formats for documents stored in paperless
Default configuration is as before (incrementing numbers), but additional fields can be added at will
2019-12-27 14:25:38 +00:00
Michael Gmelin
14b81c613c Allow configuring transformations to be applied to the filename before
parsing. The motivation was that files produced by a Brother scanner
wouldn't match paperless' expectations. At most one transformation
is applied (first matching). It won't affect the filename on disk.

This is generic enough so that it is useful for various purposes.
In my case it allows me to use the different hardware buttons on
the scanner to use different profiles, feeding one instance of
paperless with documents of multiple entities and tagging them
accordingly.

Example:

PAPERLESS_FILENAME_PARSE_TRANSFORMS=[{"pattern":"^([a-z]+)_(\\d{8})_(\\d{6})_([0-9]+)\\.", "repl":"\\2\\3Z - \\4 - \\1."},{"pattern":"^([a-z]+)_([0-9]+)\\.", "repl":" - \\2 - \\1."}]
2019-08-31 15:01:25 +02:00
Jonas Winkler
5dda05d712 Merge branch 'master' into dev 2019-05-21 13:06:16 +02:00
JensPfeifle
4c64ea0404 Add GS_BINARY to settings to avoid harcoded call of "gs" 2019-03-03 20:31:52 +01:00
Daniel Quinn
8682570653 Align example conf with real-world defaults 2019-01-27 13:05:56 +00:00
Jonas Winkler
acc144de62 Merge branch 'master' into dev 2019-01-21 15:07:49 +01:00