240 Commits

Author SHA1 Message Date
jonaswinkler
9537218701 update logging config 2021-02-06 17:05:07 +01:00
jonaswinkler
a5726ae005 fix lost messages when queuing many files 2021-02-04 23:32:11 +01:00
jonaswinkler
d08a530701 don't load sklearn libraries unless needed 2021-02-04 15:15:11 +01:00
jonaswinkler
fce76d0671 Merge branch 'dev' into feature-websockets-status 2021-01-30 16:08:50 +01:00
jonaswinkler
d6d749fa54 fix redis configuration 2021-01-27 20:08:09 +01:00
jonaswinkler
f36115ee37 alter defaults for workers and threads to allow more parallel tasks #446 2021-01-26 22:10:43 +01:00
jonaswinkler
044aa55d74 Merge branch 'dev' into feature-websockets-status 2021-01-23 22:22:17 +01:00
jonaswinkler
af58998e44 configuration option for type of PDF viewer fixes #337 2021-01-14 13:35:43 +01:00
jonaswinkler
0db42def0a DEBUG logging 2021-01-13 13:35:05 +01:00
jonaswinkler
291983600b adjusted the default configuration so that at least one cpu core remains free (except on single core machines) #332 2021-01-13 00:02:20 +01:00
jonaswinkler
c710db7aab add sslmode configuration option fixes #298 2021-01-10 01:35:56 +01:00
jonaswinkler
e3e5138df4 fixes #290 2021-01-08 13:27:57 +01:00
jonaswinkler
a9419e07bc move settings 2021-01-06 16:53:58 +01:00
Jonas Winkler
499baf6694 Merge pull request #260 from shamoon/feature/remote-user
Feature: authentication via HTTP_REMOTE_USER
2021-01-06 14:27:14 +01:00
Jonas Winkler
22f45ac619 Merge pull request #251 from jayme-github/ignore-date
Add option to ignore certain dates in parse_date
2021-01-05 00:19:13 +01:00
jonaswinkler
179b53d373 Merge branch 'dev' into feature-websockets-status 2021-01-04 22:45:56 +01:00
jonaswinkler
a4161f9b97 add french to paperless 2021-01-04 14:15:34 +01:00
Michael Shamoon
0286abd92a Refactor to extend RemoteUserMiddleware & add authentication for Django 2021-01-03 21:21:39 -08:00
jonaswinkler
10afd644c3 add dutch language 2021-01-03 22:25:12 +01:00
Michael Shamoon
8caa61bd9d Merge remote-tracking branch 'upstream/dev' into feature/remote-user 2021-01-03 00:38:10 -08:00
Michael Shamoon
009de5184f Allow authentication via HTTP_REMOTE_USER 2021-01-03 00:37:19 -08:00
jayme-github
cd15490e91 Add option to ignore certain dates in parse_date
PAPERLESS_IGNORE_DATES allows to specify a comma separated list of dates
to ignore during date parsing (from filename and content). This can be
used so specify dates that do appear often in documents but are usually
not the documents creation date (like your date of birth).
2021-01-02 15:20:49 +01:00
jonaswinkler
4eb3962b8d Merge branch 'dev' into feature-localization 2021-01-02 00:15:03 +01:00
Jo Vandeginste
aa88f25267 Refactor after feedback:
- rename PAPERLESS_TIKA to PAPERLESS_TIKA_ENABLED
- all other env params now start with PAPERLESS_TIKA
- convert_to_pdf as class instance method
- smaller details

Signed-off-by: Jo Vandeginste <Jo.Vandeginste@kuleuven.be>
2020-12-31 14:41:47 +01:00
jonaswinkler
29730a9c13 add translation support to the backend #215 2020-12-30 01:39:06 +01:00
jonaswinkler
05d4ca06fe add initial localization support for the front end #215 2020-12-30 00:26:06 +01:00
Jo Vandeginste
bf8739864d Add the new paperless_tika parser
This parser will use an external Tika and Gotenberg server to parse
"Office" documents (.doc, .xls, .odt, etc.)

Signed-off-by: Jo Vandeginste <Jo.Vandeginste@kuleuven.be>
2020-12-29 21:51:21 +01:00
jonaswinkler
fe73f42495 added configuration option for the font #197 #207 2020-12-29 12:26:41 +01:00
Jo Vandeginste
3f4db5cd32 Update settings.py 2020-12-28 22:37:53 +01:00
Jo Vandeginste
c3b4f40447 Allow extending INSTALLED_APPS via environment
This allows a user to add "apps" (aka parsers) through the environment.

Especially useful when using Docker, and adding a test-parser.

Usage:

```yaml
services:
  webserver:
    environment:
      PAPERLESS_APPS: paperless_tika.apps.PaperlessTikaConfig
```

You can add more by separating them with a `,`:

```yaml
PAPERLESS_APPS: app1,app2
```
2020-12-28 22:19:30 +01:00
jonaswinkler
df9ac0cb8a removed x-frame-options, since that was only used for the <object> pdf display tag. 2020-12-19 16:46:04 +01:00
jonaswinkler
e02838b198 fixes #122 2020-12-11 17:49:32 +01:00
jonaswinkler
638113864f fixes #90 2020-12-08 13:54:49 +01:00
jonaswinkler
0bfecaa0fc Merge branch 'dev' into feature-websockets-status 2020-12-06 22:53:54 +01:00
jonaswinkler
249422570e mail handling: When exceptions occur during account/rule/message handling, paperless will continue with the next account/rule/message.
mail handling: When paperless encounters a very long

fixes #82
2020-12-04 15:42:05 +01:00
jonaswinkler
9546d6bf8c updated the API, it now supports tags, correspondents, types and title when uploading documents. 2020-12-03 18:36:23 +01:00
jonaswinkler
39e3a8a04f reorganized logging. 2020-12-03 01:03:56 +01:00
jonaswinkler
578a1626e1 documentation. 2020-12-01 15:26:22 +01:00
jonaswinkler
e528a8446e Merge branch 'dev' into feature-ocrmypdf 2020-11-30 23:53:19 +01:00
jayme-github
a90b7a647e Create tags from sub directories
The names of sub directories in the consumer directory will be added as
tags for the document to be consumed.
To enable this, set:
PAPERLESS_CONSUMER_RECURSIVE=1
PAPERLESS_CONSUMER_SUBDIRS_AS_TAGS=1

Fixes #50
2020-11-30 14:22:35 +01:00
jonaswinkler
388f6cfbe6 reorganised settings documentation and added OCR_USER_ARGS 2020-11-29 12:38:32 +01:00
jonaswinkler
7bba3065fe Merge branch 'dev' into feature-ocrmypdf 2020-11-27 14:03:19 +01:00
jonaswinkler
6cd54034d2 tests: disable db logger in all tests except logger tests 2020-11-27 13:13:11 +01:00
Jonas Winkler
3b655c95d9 added image DPI detection to the tesseract parser. 2020-11-25 19:37:48 +01:00
Jonas Winkler
8b1aa9cf68 removed unused settings. 2020-11-25 19:30:11 +01:00
Jonas Winkler
15935ab61f reworked PDF parser that uses OCRmyPDF and produces archive files. 2020-11-25 14:50:43 +01:00
Jonas Winkler
28cd246d48 added archive directory. 2020-11-25 14:45:21 +01:00
Jonas Winkler
b275297a05 added setting PAPERLESS_AUTO_LOGIN_USERNAME 2020-11-23 22:50:02 +01:00
Jonas Winkler
3893a23852 Merge branch 'dev' into celery-tasks 2020-11-22 22:49:37 +01:00
Jonas Winkler
ae198f0767 new setting: PAPERLESS_OCR_PAGES 2020-11-22 12:54:08 +01:00