140 Commits

Author SHA1 Message Date
jonaswinkler
555a1a6b64 proper document archiver with progress bar. 2020-12-03 01:04:52 +01:00
jonaswinkler
e4eeb29f54 checking file types against parsers in the consumer. 2020-12-01 15:26:05 +01:00
jonaswinkler
e528a8446e Merge branch 'dev' into feature-ocrmypdf 2020-11-30 23:53:19 +01:00
jayme-github
a90b7a647e Create tags from sub directories
The names of sub directories in the consumer directory will be added as
tags for the document to be consumed.
To enable this, set:
PAPERLESS_CONSUMER_RECURSIVE=1
PAPERLESS_CONSUMER_SUBDIRS_AS_TAGS=1

Fixes #50
2020-11-30 14:22:35 +01:00
jonaswinkler
0e2a2eb53a code style. 2020-11-29 12:37:22 +01:00
jonaswinkler
96dc4c1daa added checksums for archived documents. 2020-11-29 12:31:26 +01:00
jonaswinkler
e6a36379fc added a simple document archiver that produces archived versions of all originals. 2020-11-28 11:49:07 +01:00
jonaswinkler
55e0b383c2 Adjusted the exporter and importer so that they take archived documents into account. 2020-11-28 11:24:59 +01:00
jonaswinkler
c6627eac1f fix warnings about unclosed files. 2020-11-27 13:19:58 +01:00
jonaswinkler
29867ba6bd inotify: cleanup descriptor when done 2020-11-27 13:12:34 +01:00
jonaswinkler
72b4f817df moved consumption dir check into the correct spot 2020-11-27 13:12:13 +01:00
jonaswinkler
f49bf187eb more tests. 2020-11-26 23:56:57 +01:00
jonaswinkler
9627a91654 removed some obsolete exporter code. 2020-11-26 23:09:17 +01:00
jonaswinkler
221c1e76e9 couple changes to the consumer. 2020-11-26 18:55:05 +01:00
jonaswinkler
dac7971cd6 Apparently there was a very good reason to use inotify. fixes #46 complete with test cases for inotify and polling. 2020-11-26 17:57:03 +01:00
Jonas Winkler
61d9794824 fixed the decryption code, but its still untested. 2020-11-25 21:10:50 +01:00
Jonas Winkler
3ba603a2e8 Paperless will continue to operate with encrypted files, however, all new files will be stored unencrypted. 2020-11-25 21:03:06 +01:00
Jonas Winkler
18165ccad2 Removed ability to encrypt documents. 2020-11-25 20:23:25 +01:00
Jonas Winkler
9c23207b84 workaround for a bug in django-q: task results with too long names would not show up in the result lists. 2020-11-22 13:53:19 +01:00
Jonas Winkler
afc3753e58 code cleanup 2020-11-21 14:03:45 +01:00
Jonas Winkler
f976a0b4ba mime type handling 2020-11-20 13:31:03 +01:00
Jonas Winkler
5eb5aa6fb6 removed unused code. 2020-11-18 00:54:51 +01:00
Jonas Winkler
24bb8c71c9 Merge branch 'dev' into mail_rework 2020-11-17 00:23:10 +01:00
Jonas Winkler
e30f0b274b added more testing 2020-11-16 23:16:37 +01:00
Jonas Winkler
2119eb4c15 added option for polling 2020-11-16 18:52:13 +01:00
Jonas Winkler
bd04c966c5 first version of the new consumer. 2020-11-16 18:26:54 +01:00
Jonas Winkler
d99b4623f8 first implementation of the mail rework 2020-11-15 23:56:22 +01:00
Jonas Winkler
eb6805e37e code style fixes 2020-11-12 21:09:45 +01:00
Jonas Winkler
1fa2c54932 on_modified not needed for the consumer. 2020-11-12 10:41:47 +01:00
Jonas Winkler
f53a958bc5 fixes #30 2020-11-12 09:30:04 +01:00
Jonas Winkler
8b8a2af053 fixed the file handling implementation. The feature is cool, but the original implementation had so many small flaws it wasn't even funny. 2020-11-11 14:21:33 +01:00
Jonas Winkler
d3e7c8ff4e added a task scheduler for recurring tasks 2020-11-09 20:29:10 +01:00
Jonas Winkler
548941b1bb added the filename handling back into the code 2020-11-08 13:00:45 +01:00
Jonas Winkler
33f1c82943 updated the classifier. Its now much faster and does not retrain when data hasnt changed. 2020-11-06 14:46:06 +01:00
Jonas Winkler
9757e261f2 A handy script to redo ocr on all documents, 2020-11-03 14:04:11 +01:00
Jonas Winkler
9a4ff3f807 replaced usages of .id with .pk, fixed filename issue in exporter 2020-11-03 12:37:37 +01:00
Jonas Winkler
a89773ad71 removed unused code, small fixes 2020-11-02 18:20:04 +01:00
Jonas Winkler
6fd73a04b8 updated consumer: now using watchdog 2020-11-01 23:07:54 +01:00
Jonas Winkler
6ce493e3a7 the document classifier is now stateless 2020-10-29 14:33:42 +01:00
Jonas Winkler
dd16b7262e unified document matching, legacy and automatching work alongside now 2020-10-28 11:45:11 +01:00
Jonas Winkler
ba210d3b5e changed a few things 2020-10-27 17:08:18 +01:00
Jonas Winkler
d3af1e8815 unified data folders 2020-10-26 00:35:24 +01:00
Jonas Winkler
93d963ed4e added
- document index
- api access for thumbnails/downloads
- more api filters

updated
- pipfile

removed
- filename handling
- legacy thumb/download access
- obsolete admin gui settings (per page items, FY, inline view)
2020-10-25 23:03:02 +01:00
Jonas Winkler
b71049ad16 Merge branch 'master' into dev 2020-10-16 15:02:57 +02:00
JOKer
5f8120add1 Merge pull request #593 from BastianPoe/feature-293
Give stored documents a structured and configurable filename
2020-05-02 08:33:49 +02:00
Johann Bauer
cea6dcce23 Warn if consume directory contains subdirectories
.
2020-01-04 01:09:54 +01:00
Wolf-Bastian Poettner
f65b90122e Added tool to rename all documents according to the lastest filename
format
2019-12-27 14:25:38 +00:00
Jonas Winkler
5dda05d712 Merge branch 'master' into dev 2019-05-21 13:06:16 +02:00
Dominik von Allmen
308d75f34d Update change_storage_type.py 2019-04-02 14:12:00 +02:00
domphonallmen
156282915e avoid error when decrypting files with non-ascii character 2019-04-02 11:38:00 +02:00