1775 Commits

Author SHA1 Message Date
Jonas Winkler
1ddbf416d4 fixed an issue with the searcher. 2020-11-10 01:47:35 +01:00
Jonas Winkler
7bfe28f451 Merge branch 'dev' into celery-tasks 2020-11-10 00:16:59 +01:00
Jonas Winkler
d3e7c8ff4e added a task scheduler for recurring tasks 2020-11-09 20:29:10 +01:00
Jonas Winkler
1ca35f6fd1 fixed dev auth 2020-11-09 15:47:22 +01:00
Jonas Winkler
0f498f1016 removed no-login option since it wasn't working with the new django version anyway. 2020-11-09 15:28:45 +01:00
Jonas Winkler
3b0e794b3d use django authentication instead of auth tokens. 2020-11-09 15:28:12 +01:00
Jonas Winkler
30f837d49f fixed most of the test cases 2020-11-08 13:49:15 +01:00
Jonas Winkler
548941b1bb added the filename handling back into the code 2020-11-08 13:00:45 +01:00
Jonas Winkler
edb5b1ea91 moved some code 2020-11-08 11:30:16 +01:00
Jonas Winkler
ec1f65a355 I removed the model save/delete hooks for index updates since they were causing too much trouble with migrations 2020-11-08 11:24:57 +01:00
Jonas Winkler
d31d45c2cf forgot a sort field 2020-11-08 00:42:39 +01:00
Jonas Winkler
f2ede49cba removed logging from index since it was causing issues with migrations and missing logging groups 2020-11-07 23:26:18 +01:00
Jonas Winkler
c1348d3dbf updated document upload form 2020-11-07 23:25:50 +01:00
Jonas Winkler
9ca95d7b0a combined migrations 2020-11-07 23:22:44 +01:00
Jonas Winkler
dd5e43e7d4 fixed up the docker 2020-11-07 12:47:17 +01:00
Jonas Winkler
d46203c114 backend that supports asgi and status update sockets with channels 2020-11-07 11:31:04 +01:00
Jonas Winkler
9829b3dae5 add correspondent to the index 2020-11-06 17:27:21 +01:00
Jonas Winkler
33f1c82943 updated the classifier. Its now much faster and does not retrain when data hasnt changed. 2020-11-06 14:46:06 +01:00
Jonas Winkler
5b3dbd8e6e allow document display with angular dev server 2020-11-04 17:26:49 +01:00
Jonas Winkler
9b99c41382 removed pinpoint fragmenter as it was screwing up highlighting 2020-11-04 17:26:26 +01:00
Jonas Winkler
828752142a disabled fuzzy matching since it screwed up scored results (?) 2020-11-04 01:00:21 +01:00
Jonas Winkler
7dfd6c47a7 fixes #11 and some server side warnings 2020-11-04 00:33:29 +01:00
Jonas Winkler
d4aa2fb169 fixes #17
search for multiple tags
2020-11-04 00:01:08 +01:00
Jonas Winkler
b941cac0f2 added backward compatibility URLs 2020-11-03 14:47:42 +01:00
Jonas Winkler
ab2644ccf0 added migration step to create initial classifier, silences consumption warnings 2020-11-03 14:10:53 +01:00
Jonas Winkler
cf5e463b9b silenced unpaper once and for all 2020-11-03 14:04:21 +01:00
Jonas Winkler
9757e261f2 A handy script to redo ocr on all documents, 2020-11-03 14:04:11 +01:00
Jonas Winkler
5686f76ef2 this was not required since saving a document updates the index anyway 2020-11-03 13:51:49 +01:00
Jonas Winkler
9a4ff3f807 replaced usages of .id with .pk, fixed filename issue in exporter 2020-11-03 12:37:37 +01:00
Jonas Winkler
64ee9a4941 updated settings: docker image runs without ENV variables 2020-11-03 12:23:24 +01:00
Jonas Winkler
749d152347 updated configuration 2020-11-02 21:59:36 +01:00
Jonas Winkler
157d440415 removed unused settings, migrations now match originals 2020-11-02 20:16:10 +01:00
Jonas Winkler
340f9f141f fixed most of the tests 2020-11-02 19:42:23 +01:00
Jonas Winkler
d42979842e made unpaper and convert a little bit nicer to interact with 2020-11-02 19:31:04 +01:00
Jonas Winkler
048ff1de95 fixed unnecessary assignation of correspondents / types to documents 2020-11-02 19:27:24 +01:00
Jonas Winkler
2a530accf7 renamed logger 2020-11-02 18:54:27 +01:00
Jonas Winkler
a89773ad71 removed unused code, small fixes 2020-11-02 18:20:04 +01:00
Jonas Winkler
b7521dad0f fix the migration history and provide a script to update existing paperless instances to the new version 2020-11-02 17:55:36 +01:00
Jonas Winkler
def3a85858 reworked most of the tesseract parser, better logging 2020-11-02 15:40:44 +01:00
Jonas Winkler
3d02d4b446 paginated search results 2020-11-02 12:23:50 +01:00
Jonas Winkler
972a6a2333 bugfix 2020-11-02 01:26:42 +01:00
Jonas Winkler
c497ff4a5d logging: don't group by logging_group 2020-11-02 01:24:56 +01:00
Jonas Winkler
ffdb517b73 removed settings constants 2020-11-01 23:37:56 +01:00
Jonas Winkler
6adc870a20 silenced unpaper, optipng for cleaner output
moved parser settings to settings
removed forgiving ocr (now default) since tesseract is plenty accurate even without defining the correct language.
2020-11-01 23:23:42 +01:00
Jonas Winkler
6fd73a04b8 updated consumer: now using watchdog 2020-11-01 23:07:54 +01:00
Jonas Winkler
2a4127dc90 basic statistics 2020-10-31 00:56:20 +01:00
Jonas Winkler
a5c2ed1d76 disabled session auth, breaks login 2020-10-29 23:15:59 +01:00
Jonas Winkler
933c327866 enabled session auth again, useful for the browsable rest api 2020-10-29 22:33:43 +01:00
Jonas Winkler
2c318b87a0 autocomplete ordering 2020-10-29 14:34:02 +01:00
Jonas Winkler
6ce493e3a7 the document classifier is now stateless 2020-10-29 14:33:42 +01:00