103 Commits

Author SHA1 Message Date
Jonas Winkler
be94a8e49a
Merge pull request #251 from jayme-github/ignore-date
Add option to ignore certain dates in parse_date
2021-01-05 00:19:13 +01:00
jayme-github
2aa2086dfb Add missing config options to example file 2021-01-03 14:35:28 +01:00
jonaswinkler
4cef4adc7e config file 2021-01-02 15:30:52 +01:00
jonaswinkler
e5c10fcd93 Merge branch 'master' into dev 2020-12-31 02:15:01 +01:00
Fabian Koller
9f18d0ad45
Fix ENV var name for user args in example config
The actual string used when looking up the user arguments ends with an S: `PAPERLESS_OCR_USER_ARGS`
2020-12-29 23:52:27 +01:00
jonaswinkler
f964dd5935 added configuration option for the font #197 #207 2020-12-29 12:26:41 +01:00
jonaswinkler
b452816a29 fixes #122 2020-12-11 17:49:32 +01:00
jonaswinkler
fca98b411e reorganised settings documentation and added OCR_USER_ARGS 2020-11-29 12:38:32 +01:00
Jonas Winkler
a03315102a added image DPI detection to the tesseract parser. 2020-11-25 19:37:48 +01:00
Jonas Winkler
d3c13f6c93 removed unused settings. 2020-11-25 19:30:11 +01:00
Jonas Winkler
2d559d330d reworked PDF parser that uses OCRmyPDF and produces archive files. 2020-11-25 14:50:43 +01:00
Jonas Winkler
f4013b1343 added setting PAPERLESS_AUTO_LOGIN_USERNAME 2020-11-23 22:50:02 +01:00
Jonas Winkler
fec9e54049 new setting: PAPERLESS_OCR_PAGES 2020-11-22 12:54:08 +01:00
Jonas Winkler
d1f9f456bb moved the documentation of the config
to the actual docs.
2020-11-19 13:18:57 +01:00
Jonas Winkler
d8e6804650 removed all quotes from the config: they are not needed and are confusing for the docker-compose env files, where they are actually not allowed. 2020-11-18 22:42:56 +01:00
Jonas Winkler
c487e5f017 a new setting that allows you to skip thumbnail optimization. 2020-11-18 22:42:05 +01:00
Jonas Winkler
fef6dd38f9 Merge branch 'dev' into mail_rework 2020-11-17 00:23:10 +01:00
Jonas Winkler
31c4167535 added option for polling 2020-11-16 18:52:13 +01:00
Jonas Winkler
f6a926c9b1 added config options to conf example 2020-11-16 18:37:12 +01:00
Jonas Winkler
0b1637da62 first implementation of the mail rework 2020-11-15 23:56:22 +01:00
Jonas Winkler
83f82f3caf added a setting: delete duplicate documents 2020-11-10 01:47:58 +01:00
Jonas Winkler
9d22d9c9b1 added a task scheduler for recurring tasks 2020-11-09 20:29:10 +01:00
Jonas Winkler
44cd8565b1 removed no-login option since it wasn't working with the new django version anyway. 2020-11-09 15:28:45 +01:00
Jonas Winkler
9067a4f288 added the filename handling back into the code 2020-11-08 13:00:45 +01:00
Jonas Winkler
dc59e0f257 updated settings: docker image runs without ENV variables 2020-11-03 12:23:24 +01:00
Jonas Winkler
9ce926df7f updated configuration 2020-11-02 21:59:36 +01:00
Jonas Winkler
9f55fb668d silenced unpaper, optipng for cleaner output
moved parser settings to settings
removed forgiving ocr (now default) since tesseract is plenty accurate even without defining the correct language.
2020-11-01 23:23:42 +01:00
Jonas Winkler
5c4849796b fixed folders, compatible with previous paperless version 2020-10-29 14:30:15 +01:00
Jonas Winkler
322832c9b3 config cleanup 2020-10-29 00:34:37 +01:00
Jonas Winkler
c26962f17f changed a few things 2020-10-27 17:08:18 +01:00
Jonas Winkler
052c1680f3 added
- document index
- api access for thumbnails/downloads
- more api filters

updated
- pipfile

removed
- filename handling
- legacy thumb/download access
- obsolete admin gui settings (per page items, FY, inline view)
2020-10-25 23:03:02 +01:00
Jonas Winkler
b0143a280e removed reminders, changed a bunch of stuff 2020-10-20 00:35:39 +02:00
Jonas Winkler
421dab786d Merge branch 'master' into dev 2020-10-16 15:02:57 +02:00
Wolf-Bastian Poettner
07bb9aa723 Allow filename to use tags[KEY] and tags[INDEX] 2020-03-28 21:06:17 +00:00
Wolf-Bastian Poettner
d8fda64cd0 Tags in filenames can now either be found by key (in case - or _ are
used as delimeter) or by index. Added some more tests.
2020-02-01 20:58:44 +00:00
Wolf-Bastian Poettner
6a671ebd22 Unified PAPERLESS_DIRECTORY_FORMAT and PAPERLESS_FILENAME_FORMAT into
one configuration setting
2020-02-01 20:09:36 +00:00
Wolf-Bastian Pöttner
e70d0c53c8
Update paperless.conf.example
Co-Authored-By: Pit <pitkley@googlemail.com>
2020-02-01 14:22:02 +01:00
Wolf-Bastian Poettner
6813805712 Allows to configure directory and filename formats for documents stored in paperless
Default configuration is as before (incrementing numbers), but additional fields can be added at will
2019-12-27 14:25:38 +00:00
Michael Gmelin
f0320fb72d Allow configuring transformations to be applied to the filename before
parsing. The motivation was that files produced by a Brother scanner
wouldn't match paperless' expectations. At most one transformation
is applied (first matching). It won't affect the filename on disk.

This is generic enough so that it is useful for various purposes.
In my case it allows me to use the different hardware buttons on
the scanner to use different profiles, feeding one instance of
paperless with documents of multiple entities and tagging them
accordingly.

Example:

PAPERLESS_FILENAME_PARSE_TRANSFORMS=[{"pattern":"^([a-z]+)_(\\d{8})_(\\d{6})_([0-9]+)\\.", "repl":"\\2\\3Z - \\4 - \\1."},{"pattern":"^([a-z]+)_([0-9]+)\\.", "repl":" - \\2 - \\1."}]
2019-08-31 15:01:25 +02:00
Jonas Winkler
32f3876590 Merge branch 'master' into dev 2019-05-21 13:06:16 +02:00
JensPfeifle
ea282c22ba Add GS_BINARY to settings to avoid harcoded call of "gs" 2019-03-03 20:31:52 +01:00
Daniel Quinn
3c78105fd7 Align example conf with real-world defaults 2019-01-27 13:05:56 +00:00
Jonas Winkler
4bd018df07 Merge branch 'master' into dev 2019-01-21 15:07:49 +01:00
Daniel Quinn
cea8332038
Merge pull request #468 from ddddavidmartin/document_ocr_always_setting
Reference PAPERLESS_OCR_ALWAYS in example config file.
2018-12-31 14:30:58 +00:00
David Martin
4d13521f36 Reference PAPERLESS_OCR_ALWAYS in example config file.
This setting was introduced when support for retrieving the text layer
from documents was added. Having it in the example config makes it more
clear that it exists.
2018-12-16 18:11:39 +11:00
Jonas Winkler
766109ae4e Merge remote-tracking branch 'upstream/master' 2018-12-11 12:06:15 +01:00
Daniel Quinn
d544f269e0 Conform everything to the coding standards
https://paperless.readthedocs.io/en/latest/contributing.html#additional-style-guides
2018-12-01 17:09:12 +00:00
Joshua Taillon
730daa3d6d Merge branch 'master' of github.com:danielquinn/paperless into ENH_filename_date_parsing 2018-11-15 23:17:59 -05:00
Joshua Taillon
e1d8744c66 Add option for parsing of date from filename (and associated tests) 2018-11-15 20:32:15 -05:00
David Martin
818780a191 Add PAPERLESS_FORGIVING_OCR option to example config.
It helps having it in the example config as that makes it more clear
that it exists.
2018-10-08 19:38:38 +11:00