568 Commits

Author SHA1 Message Date
Jonas Winkler
52ce5676df fixes the docker build. also: proper 404 for missing documents/thumbnails 2020-11-13 22:31:39 +01:00
Jonas Winkler
b195d17158 add exception handler for invalid filename formats. 2020-11-13 20:31:51 +01:00
Jonas Winkler
4734dec465 add some more checks. 2020-11-12 21:20:12 +01:00
Jonas Winkler
eb6805e37e code style fixes 2020-11-12 21:09:45 +01:00
Jonas Winkler
4fba7b661e trying to make travis work. 2020-11-12 17:12:58 +01:00
Jonas Winkler
e2270c2ba2 added some more fields for document file naming. 2020-11-12 11:12:39 +01:00
Jonas Winkler
23eae9a3f1 This is how the original filenames were generated. Keep it this way for compatibility. 2020-11-12 10:42:18 +01:00
Jonas Winkler
1fa2c54932 on_modified not needed for the consumer. 2020-11-12 10:41:47 +01:00
Jonas Winkler
1c50b7693d fixes #31 2020-11-12 10:04:01 +01:00
Jonas Winkler
f53a958bc5 fixes #30 2020-11-12 09:30:04 +01:00
Jonas Winkler
1e3e80930c enable Group and User management. 2020-11-11 20:25:33 +01:00
Jonas Winkler
5a658b7ad6 show the filename in the admin. 2020-11-11 15:58:29 +01:00
Jonas Winkler
ce7bb51df2 test database errors. 2020-11-11 14:38:41 +01:00
Jonas Winkler
8b8a2af053 fixed the file handling implementation. The feature is cool, but the original implementation had so many small flaws it wasn't even funny. 2020-11-11 14:21:33 +01:00
Jonas Winkler
a91e46364a small consumer fixes 2020-11-11 14:14:21 +01:00
Jonas Winkler
e8d4696c71 remove only automatically created schedules (almost) 2020-11-11 14:13:54 +01:00
Jonas Winkler
3048342de7 added a setting: delete duplicate documents 2020-11-10 01:47:58 +01:00
Jonas Winkler
1ddbf416d4 fixed an issue with the searcher. 2020-11-10 01:47:35 +01:00
Jonas Winkler
d3e7c8ff4e added a task scheduler for recurring tasks 2020-11-09 20:29:10 +01:00
Jonas Winkler
3b0e794b3d use django authentication instead of auth tokens. 2020-11-09 15:28:12 +01:00
Jonas Winkler
30f837d49f fixed most of the test cases 2020-11-08 13:49:15 +01:00
Jonas Winkler
548941b1bb added the filename handling back into the code 2020-11-08 13:00:45 +01:00
Jonas Winkler
edb5b1ea91 moved some code 2020-11-08 11:30:16 +01:00
Jonas Winkler
ec1f65a355 I removed the model save/delete hooks for index updates since they were causing too much trouble with migrations 2020-11-08 11:24:57 +01:00
Jonas Winkler
d31d45c2cf forgot a sort field 2020-11-08 00:42:39 +01:00
Jonas Winkler
f2ede49cba removed logging from index since it was causing issues with migrations and missing logging groups 2020-11-07 23:26:18 +01:00
Jonas Winkler
c1348d3dbf updated document upload form 2020-11-07 23:25:50 +01:00
Jonas Winkler
9ca95d7b0a combined migrations 2020-11-07 23:22:44 +01:00
Jonas Winkler
9829b3dae5 add correspondent to the index 2020-11-06 17:27:21 +01:00
Jonas Winkler
33f1c82943 updated the classifier. Its now much faster and does not retrain when data hasnt changed. 2020-11-06 14:46:06 +01:00
Jonas Winkler
9b99c41382 removed pinpoint fragmenter as it was screwing up highlighting 2020-11-04 17:26:26 +01:00
Jonas Winkler
828752142a disabled fuzzy matching since it screwed up scored results (?) 2020-11-04 01:00:21 +01:00
Jonas Winkler
7dfd6c47a7 fixes #11 and some server side warnings 2020-11-04 00:33:29 +01:00
Jonas Winkler
d4aa2fb169 fixes #17
search for multiple tags
2020-11-04 00:01:08 +01:00
Jonas Winkler
ab2644ccf0 added migration step to create initial classifier, silences consumption warnings 2020-11-03 14:10:53 +01:00
Jonas Winkler
cf5e463b9b silenced unpaper once and for all 2020-11-03 14:04:21 +01:00
Jonas Winkler
9757e261f2 A handy script to redo ocr on all documents, 2020-11-03 14:04:11 +01:00
Jonas Winkler
5686f76ef2 this was not required since saving a document updates the index anyway 2020-11-03 13:51:49 +01:00
Jonas Winkler
9a4ff3f807 replaced usages of .id with .pk, fixed filename issue in exporter 2020-11-03 12:37:37 +01:00
Jonas Winkler
64ee9a4941 updated settings: docker image runs without ENV variables 2020-11-03 12:23:24 +01:00
Jonas Winkler
157d440415 removed unused settings, migrations now match originals 2020-11-02 20:16:10 +01:00
Jonas Winkler
340f9f141f fixed most of the tests 2020-11-02 19:42:23 +01:00
Jonas Winkler
d42979842e made unpaper and convert a little bit nicer to interact with 2020-11-02 19:31:04 +01:00
Jonas Winkler
048ff1de95 fixed unnecessary assignation of correspondents / types to documents 2020-11-02 19:27:24 +01:00
Jonas Winkler
2a530accf7 renamed logger 2020-11-02 18:54:27 +01:00
Jonas Winkler
a89773ad71 removed unused code, small fixes 2020-11-02 18:20:04 +01:00
Jonas Winkler
b7521dad0f fix the migration history and provide a script to update existing paperless instances to the new version 2020-11-02 17:55:36 +01:00
Jonas Winkler
def3a85858 reworked most of the tesseract parser, better logging 2020-11-02 15:40:44 +01:00
Jonas Winkler
3d02d4b446 paginated search results 2020-11-02 12:23:50 +01:00
Jonas Winkler
c497ff4a5d logging: don't group by logging_group 2020-11-02 01:24:56 +01:00