97 Commits

Author SHA1 Message Date
jonaswinkler
e02838b198 fixes #122 2020-12-11 17:49:32 +01:00
jonaswinkler
388f6cfbe6 reorganised settings documentation and added OCR_USER_ARGS 2020-11-29 12:38:32 +01:00
Jonas Winkler
3b655c95d9 added image DPI detection to the tesseract parser. 2020-11-25 19:37:48 +01:00
Jonas Winkler
8b1aa9cf68 removed unused settings. 2020-11-25 19:30:11 +01:00
Jonas Winkler
15935ab61f reworked PDF parser that uses OCRmyPDF and produces archive files. 2020-11-25 14:50:43 +01:00
Jonas Winkler
b275297a05 added setting PAPERLESS_AUTO_LOGIN_USERNAME 2020-11-23 22:50:02 +01:00
Jonas Winkler
ae198f0767 new setting: PAPERLESS_OCR_PAGES 2020-11-22 12:54:08 +01:00
Jonas Winkler
99c5ecd99a moved the documentation of the config
to the actual docs.
2020-11-19 13:18:57 +01:00
Jonas Winkler
8f95fb1c8b removed all quotes from the config: they are not needed and are confusing for the docker-compose env files, where they are actually not allowed. 2020-11-18 22:42:56 +01:00
Jonas Winkler
4230a0a474 a new setting that allows you to skip thumbnail optimization. 2020-11-18 22:42:05 +01:00
Jonas Winkler
24bb8c71c9 Merge branch 'dev' into mail_rework 2020-11-17 00:23:10 +01:00
Jonas Winkler
2119eb4c15 added option for polling 2020-11-16 18:52:13 +01:00
Jonas Winkler
b6d2943e89 added config options to conf example 2020-11-16 18:37:12 +01:00
Jonas Winkler
d99b4623f8 first implementation of the mail rework 2020-11-15 23:56:22 +01:00
Jonas Winkler
3048342de7 added a setting: delete duplicate documents 2020-11-10 01:47:58 +01:00
Jonas Winkler
d3e7c8ff4e added a task scheduler for recurring tasks 2020-11-09 20:29:10 +01:00
Jonas Winkler
0f498f1016 removed no-login option since it wasn't working with the new django version anyway. 2020-11-09 15:28:45 +01:00
Jonas Winkler
548941b1bb added the filename handling back into the code 2020-11-08 13:00:45 +01:00
Jonas Winkler
64ee9a4941 updated settings: docker image runs without ENV variables 2020-11-03 12:23:24 +01:00
Jonas Winkler
749d152347 updated configuration 2020-11-02 21:59:36 +01:00
Jonas Winkler
6adc870a20 silenced unpaper, optipng for cleaner output
moved parser settings to settings
removed forgiving ocr (now default) since tesseract is plenty accurate even without defining the correct language.
2020-11-01 23:23:42 +01:00
Jonas Winkler
46d1fdc108 fixed folders, compatible with previous paperless version 2020-10-29 14:30:15 +01:00
Jonas Winkler
3fee159306 config cleanup 2020-10-29 00:34:37 +01:00
Jonas Winkler
ba210d3b5e changed a few things 2020-10-27 17:08:18 +01:00
Jonas Winkler
93d963ed4e added
- document index
- api access for thumbnails/downloads
- more api filters

updated
- pipfile

removed
- filename handling
- legacy thumb/download access
- obsolete admin gui settings (per page items, FY, inline view)
2020-10-25 23:03:02 +01:00
Jonas Winkler
67d9040580 removed reminders, changed a bunch of stuff 2020-10-20 00:35:39 +02:00
Jonas Winkler
b71049ad16 Merge branch 'master' into dev 2020-10-16 15:02:57 +02:00
Wolf-Bastian Poettner
85a0560d77 Allow filename to use tags[KEY] and tags[INDEX] 2020-03-28 21:06:17 +00:00
Wolf-Bastian Poettner
7391f5c1a2 Tags in filenames can now either be found by key (in case - or _ are
used as delimeter) or by index. Added some more tests.
2020-02-01 20:58:44 +00:00
Wolf-Bastian Poettner
a5518a307c Unified PAPERLESS_DIRECTORY_FORMAT and PAPERLESS_FILENAME_FORMAT into
one configuration setting
2020-02-01 20:09:36 +00:00
Wolf-Bastian Pöttner
2d98951d88 Update paperless.conf.example
Co-Authored-By: Pit <pitkley@googlemail.com>
2020-02-01 14:22:02 +01:00
Wolf-Bastian Poettner
d1a54d6576 Allows to configure directory and filename formats for documents stored in paperless
Default configuration is as before (incrementing numbers), but additional fields can be added at will
2019-12-27 14:25:38 +00:00
Michael Gmelin
14b81c613c Allow configuring transformations to be applied to the filename before
parsing. The motivation was that files produced by a Brother scanner
wouldn't match paperless' expectations. At most one transformation
is applied (first matching). It won't affect the filename on disk.

This is generic enough so that it is useful for various purposes.
In my case it allows me to use the different hardware buttons on
the scanner to use different profiles, feeding one instance of
paperless with documents of multiple entities and tagging them
accordingly.

Example:

PAPERLESS_FILENAME_PARSE_TRANSFORMS=[{"pattern":"^([a-z]+)_(\\d{8})_(\\d{6})_([0-9]+)\\.", "repl":"\\2\\3Z - \\4 - \\1."},{"pattern":"^([a-z]+)_([0-9]+)\\.", "repl":" - \\2 - \\1."}]
2019-08-31 15:01:25 +02:00
Jonas Winkler
5dda05d712 Merge branch 'master' into dev 2019-05-21 13:06:16 +02:00
JensPfeifle
4c64ea0404 Add GS_BINARY to settings to avoid harcoded call of "gs" 2019-03-03 20:31:52 +01:00
Daniel Quinn
8682570653 Align example conf with real-world defaults 2019-01-27 13:05:56 +00:00
Jonas Winkler
acc144de62 Merge branch 'master' into dev 2019-01-21 15:07:49 +01:00
Daniel Quinn
b150fd431b Merge pull request #468 from ddddavidmartin/document_ocr_always_setting
Reference PAPERLESS_OCR_ALWAYS in example config file.
2018-12-31 14:30:58 +00:00
David Martin
1076320a06 Reference PAPERLESS_OCR_ALWAYS in example config file.
This setting was introduced when support for retrieving the text layer
from documents was added. Having it in the example config makes it more
clear that it exists.
2018-12-16 18:11:39 +11:00
Jonas Winkler
8f0d53c54a Merge remote-tracking branch 'upstream/master' 2018-12-11 12:06:15 +01:00
Daniel Quinn
0d59844567 Conform everything to the coding standards
https://paperless.readthedocs.io/en/latest/contributing.html#additional-style-guides
2018-12-01 17:09:12 +00:00
Joshua Taillon
b0326b5a19 Merge branch 'master' of github.com:danielquinn/paperless into ENH_filename_date_parsing 2018-11-15 23:17:59 -05:00
Joshua Taillon
a2422cc529 Add option for parsing of date from filename (and associated tests) 2018-11-15 20:32:15 -05:00
David Martin
8cf32d2a5a Add PAPERLESS_FORGIVING_OCR option to example config.
It helps having it in the example config as that makes it more clear
that it exists.
2018-10-08 19:38:38 +11:00
Daniel Quinn
bc898c1992 Use optipng to optimise document thumbnails 2018-10-07 14:56:38 +01:00
Jonas Winkler
7887892e4e Added a bunch of new features:
- Debug mode is now configurable in the configuration file. This way, we don't have to edit versioned files to disable it on production systems.
- Recent correspondents filter (enable in configuration file)
- Document actions: Edit tags and correspondents on multiple documents at once
- Replaced month list filter with date drilldown
- Sortable document count columns on Tag and Correspondent admin
- Last correspondence column on Correspondent admin
- Save and edit next functionality for document editing
2018-09-13 15:19:25 +02:00
Jonas Winkler
5390672762 Fixed a few minor issues. 2018-09-12 16:25:23 +02:00
Jonas Winkler
a48cc6c627 Merge branch 'master' into dev 2018-09-12 11:47:35 +02:00
Jonas Winkler
9c51d7d2d1 fixed settings 2018-09-11 17:30:46 +02:00
Daniel Quinn
0521506239 Make the example file contain the default value 2018-09-09 21:16:53 +01:00