257 Commits

Author SHA1 Message Date
Jonas Winkler
33f1c82943 updated the classifier. Its now much faster and does not retrain when data hasnt changed. 2020-11-06 14:46:06 +01:00
Jonas Winkler
9757e261f2 A handy script to redo ocr on all documents, 2020-11-03 14:04:11 +01:00
Jonas Winkler
9a4ff3f807 replaced usages of .id with .pk, fixed filename issue in exporter 2020-11-03 12:37:37 +01:00
Jonas Winkler
a89773ad71 removed unused code, small fixes 2020-11-02 18:20:04 +01:00
Jonas Winkler
6fd73a04b8 updated consumer: now using watchdog 2020-11-01 23:07:54 +01:00
Jonas Winkler
6ce493e3a7 the document classifier is now stateless 2020-10-29 14:33:42 +01:00
Jonas Winkler
dd16b7262e unified document matching, legacy and automatching work alongside now 2020-10-28 11:45:11 +01:00
Jonas Winkler
ba210d3b5e changed a few things 2020-10-27 17:08:18 +01:00
Jonas Winkler
d3af1e8815 unified data folders 2020-10-26 00:35:24 +01:00
Jonas Winkler
93d963ed4e added
- document index
- api access for thumbnails/downloads
- more api filters

updated
- pipfile

removed
- filename handling
- legacy thumb/download access
- obsolete admin gui settings (per page items, FY, inline view)
2020-10-25 23:03:02 +01:00
Jonas Winkler
b71049ad16 Merge branch 'master' into dev 2020-10-16 15:02:57 +02:00
JOKer
5f8120add1 Merge pull request #593 from BastianPoe/feature-293
Give stored documents a structured and configurable filename
2020-05-02 08:33:49 +02:00
Johann Bauer
cea6dcce23 Warn if consume directory contains subdirectories
.
2020-01-04 01:09:54 +01:00
Wolf-Bastian Poettner
f65b90122e Added tool to rename all documents according to the lastest filename
format
2019-12-27 14:25:38 +00:00
Jonas Winkler
5dda05d712 Merge branch 'master' into dev 2019-05-21 13:06:16 +02:00
Dominik von Allmen
308d75f34d Update change_storage_type.py 2019-04-02 14:12:00 +02:00
domphonallmen
156282915e avoid error when decrypting files with non-ascii character 2019-04-02 11:38:00 +02:00
Jonas Winkler
b71657964b Code style changes 2018-09-26 10:51:42 +02:00
Jonas Winkler
f7a37b8530 Removed the archive tag, as it wasnt really used anyway. 2018-09-25 21:51:38 +02:00
Jonas Winkler
64815793bb Code style changes 2018-09-25 21:12:47 +02:00
Jonas Winkler
efc7bf1d23 Code style adjustments 2018-09-25 16:09:33 +02:00
Jonas Winkler
0f2a79ea61 Merge remote-tracking branch 'upstream/master' 2018-09-25 14:47:12 +02:00
Daniel Quinn
bc33b82978 Tweak the import/export system to handle encryption choices better
Now when you export a document, the `storage_type` value is always
`unencrypted` (since that's what it is when it's exported anyway), and
the flag is set by the importing script instead, based on the existence
of a `PAPERLESS_PASSPHRASE` environment variable, indicating that
encryption is enabled.
2018-09-23 13:58:40 +01:00
Jonas Winkler
20233a1706 Code style changed 2018-09-13 14:15:16 +02:00
Jonas Winkler
5390672762 Fixed a few minor issues. 2018-09-12 16:25:23 +02:00
Jonas Winkler
35ea0f2add Merge branch 'machine-learning' into dev 2018-09-11 14:36:21 +02:00
Jonas Winkler
8a16b62773 The classifier works with ids now, not names. Minor changes. 2018-09-11 14:30:18 +02:00
Jonas Winkler
d2929e974a changed classifier 2018-09-11 00:33:07 +02:00
Jonas Winkler
1c8576cfb9 mode change 2018-09-06 12:00:01 +02:00
Jonas Winkler
d725f20505 Merge branch 'dev' into machine-learning 2018-09-06 00:29:41 +02:00
Jonas Winkler
069249cc0a Merge branch 'master' into dev 2018-09-06 00:28:58 +02:00
Jonas Winkler
c701a8f59c Merge branch 'dev' into machine-learning 2018-09-05 15:26:39 +02:00
Jonas Winkler
bbba57dd4d implemented automatic classification field functionality 2018-09-05 14:31:02 +02:00
Jonas Winkler
582e9c5cb4 Fixed a few things 2018-09-05 12:43:11 +02:00
Jonas Winkler
9d4155a907 removed matching model fields, automatic classifier reloading, added autmatic_classification field to matching model 2018-09-04 18:40:26 +02:00
Jonas Winkler
8a1a794577 Document Type exporting 2018-09-04 14:55:29 +02:00
Jonas Winkler
c50c517928 Implemented the classifier model, including automatic tagging of new documents 2018-09-04 14:39:55 +02:00
Jonas Winkler
3eecd67fc1 Added code that trains models based on data from the databasae 2018-09-03 15:55:41 +02:00
Daniel Quinn
ef0b33e72e Clean up some linter complaints 2018-09-02 20:33:49 +01:00
Jonas Winkler
daa93883ee Added command to create datasets 2018-09-02 12:47:19 +02:00
Jonas Winkler
c03cfb176c inbox tags, archive tags, archive serial number for documents 2018-07-06 13:25:02 +02:00
Daniel Quinn
e7e69d3f6f Remove emoji from storage-type changer 2018-06-17 17:23:50 +01:00
Daniel Quinn
044d707c40 Update import & export to handle encryption toggle 2018-06-17 17:06:22 +01:00
Daniel Quinn
e7fefc40fe Merge branch 'master' into mcronce-disable_encryption 2018-06-17 16:32:51 +01:00
Daniel Quinn
d1b6e9329f It's exist_ok=, not exists_ok= -- my bad. 2018-05-28 13:08:00 +01:00
Daniel Quinn
4576541c28 Add script to (de|en)crypt all documents 2018-05-28 12:58:28 +01:00
Erik Arvstedt
d132e2b9f5 fixup: remove helper fn 'make_dirs' 2018-05-21 00:45:00 +02:00
Erik Arvstedt
3db175dfe2 Add inotify support 2018-05-11 14:14:50 +02:00
Erik Arvstedt
b74b47423d Consumer loop: make sleep duration dynamic
Make the sleep duration dynamic to account for the time spent in
loop_step.
This improves responsiveness when repeatedly consuming newly
arriving docs.

Use float epoch seconds (time.time()) as the time type for
MailFetcher.last_checked to allow for natural time arithmetic.
2018-05-11 14:14:50 +02:00
Erik Arvstedt
aac17670de Refactor: renamings, extract fn 'loop'
Renamings:
loop -> loop_step
delta -> next_mail_time (this variable names a point in time, not a duration)

Extracting the 'loop' fn is a preparation for later commits where a
second type of loop is added.
2018-05-11 14:14:25 +02:00