112 Commits

Author SHA1 Message Date
Fabian Ohler
1bce697475 add visibility of PAPERLESS_PRE_CONSUME_SCRIPT 2022-01-26 10:48:05 +01:00
Daniel Albers
3ebe6d5aef
Make ignores configurable
Adds config file setting PAPERLESS_CONSUMER_IGNORE_PATTERNS.
2021-08-18 22:23:18 +02:00
jonaswinkler
cb10617979 enable deskewing and rotation by default 2021-02-21 23:40:26 +01:00
jonaswinkler
ce121a261d completely reworked the OCRmyPDF parser. 2021-02-21 00:16:57 +01:00
Fabian Koller
4fa1779ef3
Only provide legal cors allowed hosts in example
Addresses that do not carry a scheme cause errors on database migration.


SystemCheckError: System check identified some issues:

ERRORS:
?: (corsheaders.E013) Origin 'example.com' in CORS_ALLOWED_ORIGINS is missing scheme or netloc
    HINT: Add a scheme (e.g. https://) or netloc (e.g. example.com).
2021-01-14 21:14:46 +01:00
jonaswinkler
6c7281ae88 add sslmode configuration option fixes #298 2021-01-10 01:35:56 +01:00
Jonas Winkler
b17d9f850e
Merge pull request #260 from shamoon/feature/remote-user
Feature: authentication via HTTP_REMOTE_USER
2021-01-06 14:27:14 +01:00
Jonas Winkler
be94a8e49a
Merge pull request #251 from jayme-github/ignore-date
Add option to ignore certain dates in parse_date
2021-01-05 00:19:13 +01:00
jayme-github
2aa2086dfb Add missing config options to example file 2021-01-03 14:35:28 +01:00
Michael Shamoon
f0a1aed029 Merge remote-tracking branch 'upstream/dev' into feature/remote-user 2021-01-03 00:38:10 -08:00
Michael Shamoon
7b56ad9dad Allow authentication via HTTP_REMOTE_USER 2021-01-03 00:37:19 -08:00
jonaswinkler
4cef4adc7e config file 2021-01-02 15:30:52 +01:00
jonaswinkler
e5c10fcd93 Merge branch 'master' into dev 2020-12-31 02:15:01 +01:00
Fabian Koller
9f18d0ad45
Fix ENV var name for user args in example config
The actual string used when looking up the user arguments ends with an S: `PAPERLESS_OCR_USER_ARGS`
2020-12-29 23:52:27 +01:00
jonaswinkler
f964dd5935 added configuration option for the font #197 #207 2020-12-29 12:26:41 +01:00
jonaswinkler
b452816a29 fixes #122 2020-12-11 17:49:32 +01:00
jonaswinkler
fca98b411e reorganised settings documentation and added OCR_USER_ARGS 2020-11-29 12:38:32 +01:00
Jonas Winkler
a03315102a added image DPI detection to the tesseract parser. 2020-11-25 19:37:48 +01:00
Jonas Winkler
d3c13f6c93 removed unused settings. 2020-11-25 19:30:11 +01:00
Jonas Winkler
2d559d330d reworked PDF parser that uses OCRmyPDF and produces archive files. 2020-11-25 14:50:43 +01:00
Jonas Winkler
f4013b1343 added setting PAPERLESS_AUTO_LOGIN_USERNAME 2020-11-23 22:50:02 +01:00
Jonas Winkler
fec9e54049 new setting: PAPERLESS_OCR_PAGES 2020-11-22 12:54:08 +01:00
Jonas Winkler
d1f9f456bb moved the documentation of the config
to the actual docs.
2020-11-19 13:18:57 +01:00
Jonas Winkler
d8e6804650 removed all quotes from the config: they are not needed and are confusing for the docker-compose env files, where they are actually not allowed. 2020-11-18 22:42:56 +01:00
Jonas Winkler
c487e5f017 a new setting that allows you to skip thumbnail optimization. 2020-11-18 22:42:05 +01:00
Jonas Winkler
fef6dd38f9 Merge branch 'dev' into mail_rework 2020-11-17 00:23:10 +01:00
Jonas Winkler
31c4167535 added option for polling 2020-11-16 18:52:13 +01:00
Jonas Winkler
f6a926c9b1 added config options to conf example 2020-11-16 18:37:12 +01:00
Jonas Winkler
0b1637da62 first implementation of the mail rework 2020-11-15 23:56:22 +01:00
Jonas Winkler
83f82f3caf added a setting: delete duplicate documents 2020-11-10 01:47:58 +01:00
Jonas Winkler
9d22d9c9b1 added a task scheduler for recurring tasks 2020-11-09 20:29:10 +01:00
Jonas Winkler
44cd8565b1 removed no-login option since it wasn't working with the new django version anyway. 2020-11-09 15:28:45 +01:00
Jonas Winkler
9067a4f288 added the filename handling back into the code 2020-11-08 13:00:45 +01:00
Jonas Winkler
dc59e0f257 updated settings: docker image runs without ENV variables 2020-11-03 12:23:24 +01:00
Jonas Winkler
9ce926df7f updated configuration 2020-11-02 21:59:36 +01:00
Jonas Winkler
9f55fb668d silenced unpaper, optipng for cleaner output
moved parser settings to settings
removed forgiving ocr (now default) since tesseract is plenty accurate even without defining the correct language.
2020-11-01 23:23:42 +01:00
Jonas Winkler
5c4849796b fixed folders, compatible with previous paperless version 2020-10-29 14:30:15 +01:00
Jonas Winkler
322832c9b3 config cleanup 2020-10-29 00:34:37 +01:00
Jonas Winkler
c26962f17f changed a few things 2020-10-27 17:08:18 +01:00
Jonas Winkler
052c1680f3 added
- document index
- api access for thumbnails/downloads
- more api filters

updated
- pipfile

removed
- filename handling
- legacy thumb/download access
- obsolete admin gui settings (per page items, FY, inline view)
2020-10-25 23:03:02 +01:00
Jonas Winkler
b0143a280e removed reminders, changed a bunch of stuff 2020-10-20 00:35:39 +02:00
Jonas Winkler
421dab786d Merge branch 'master' into dev 2020-10-16 15:02:57 +02:00
Wolf-Bastian Poettner
07bb9aa723 Allow filename to use tags[KEY] and tags[INDEX] 2020-03-28 21:06:17 +00:00
Wolf-Bastian Poettner
d8fda64cd0 Tags in filenames can now either be found by key (in case - or _ are
used as delimeter) or by index. Added some more tests.
2020-02-01 20:58:44 +00:00
Wolf-Bastian Poettner
6a671ebd22 Unified PAPERLESS_DIRECTORY_FORMAT and PAPERLESS_FILENAME_FORMAT into
one configuration setting
2020-02-01 20:09:36 +00:00
Wolf-Bastian Pöttner
e70d0c53c8
Update paperless.conf.example
Co-Authored-By: Pit <pitkley@googlemail.com>
2020-02-01 14:22:02 +01:00
Wolf-Bastian Poettner
6813805712 Allows to configure directory and filename formats for documents stored in paperless
Default configuration is as before (incrementing numbers), but additional fields can be added at will
2019-12-27 14:25:38 +00:00
Michael Gmelin
f0320fb72d Allow configuring transformations to be applied to the filename before
parsing. The motivation was that files produced by a Brother scanner
wouldn't match paperless' expectations. At most one transformation
is applied (first matching). It won't affect the filename on disk.

This is generic enough so that it is useful for various purposes.
In my case it allows me to use the different hardware buttons on
the scanner to use different profiles, feeding one instance of
paperless with documents of multiple entities and tagging them
accordingly.

Example:

PAPERLESS_FILENAME_PARSE_TRANSFORMS=[{"pattern":"^([a-z]+)_(\\d{8})_(\\d{6})_([0-9]+)\\.", "repl":"\\2\\3Z - \\4 - \\1."},{"pattern":"^([a-z]+)_([0-9]+)\\.", "repl":" - \\2 - \\1."}]
2019-08-31 15:01:25 +02:00
Jonas Winkler
32f3876590 Merge branch 'master' into dev 2019-05-21 13:06:16 +02:00
JensPfeifle
ea282c22ba Add GS_BINARY to settings to avoid harcoded call of "gs" 2019-03-03 20:31:52 +01:00