140 Commits

Author SHA1 Message Date
Jonas Winkler
b71657964b Code style changes 2018-09-26 10:51:42 +02:00
Jonas Winkler
f7a37b8530 Removed the archive tag, as it wasnt really used anyway. 2018-09-25 21:51:38 +02:00
Jonas Winkler
64815793bb Code style changes 2018-09-25 21:12:47 +02:00
Jonas Winkler
efc7bf1d23 Code style adjustments 2018-09-25 16:09:33 +02:00
Jonas Winkler
0f2a79ea61 Merge remote-tracking branch 'upstream/master' 2018-09-25 14:47:12 +02:00
Daniel Quinn
bc33b82978 Tweak the import/export system to handle encryption choices better
Now when you export a document, the `storage_type` value is always
`unencrypted` (since that's what it is when it's exported anyway), and
the flag is set by the importing script instead, based on the existence
of a `PAPERLESS_PASSPHRASE` environment variable, indicating that
encryption is enabled.
2018-09-23 13:58:40 +01:00
Jonas Winkler
20233a1706 Code style changed 2018-09-13 14:15:16 +02:00
Jonas Winkler
5390672762 Fixed a few minor issues. 2018-09-12 16:25:23 +02:00
Jonas Winkler
35ea0f2add Merge branch 'machine-learning' into dev 2018-09-11 14:36:21 +02:00
Jonas Winkler
8a16b62773 The classifier works with ids now, not names. Minor changes. 2018-09-11 14:30:18 +02:00
Jonas Winkler
d2929e974a changed classifier 2018-09-11 00:33:07 +02:00
Jonas Winkler
1c8576cfb9 mode change 2018-09-06 12:00:01 +02:00
Jonas Winkler
d725f20505 Merge branch 'dev' into machine-learning 2018-09-06 00:29:41 +02:00
Jonas Winkler
069249cc0a Merge branch 'master' into dev 2018-09-06 00:28:58 +02:00
Jonas Winkler
c701a8f59c Merge branch 'dev' into machine-learning 2018-09-05 15:26:39 +02:00
Jonas Winkler
bbba57dd4d implemented automatic classification field functionality 2018-09-05 14:31:02 +02:00
Jonas Winkler
582e9c5cb4 Fixed a few things 2018-09-05 12:43:11 +02:00
Jonas Winkler
9d4155a907 removed matching model fields, automatic classifier reloading, added autmatic_classification field to matching model 2018-09-04 18:40:26 +02:00
Jonas Winkler
8a1a794577 Document Type exporting 2018-09-04 14:55:29 +02:00
Jonas Winkler
c50c517928 Implemented the classifier model, including automatic tagging of new documents 2018-09-04 14:39:55 +02:00
Jonas Winkler
3eecd67fc1 Added code that trains models based on data from the databasae 2018-09-03 15:55:41 +02:00
Daniel Quinn
ef0b33e72e Clean up some linter complaints 2018-09-02 20:33:49 +01:00
Jonas Winkler
daa93883ee Added command to create datasets 2018-09-02 12:47:19 +02:00
Jonas Winkler
c03cfb176c inbox tags, archive tags, archive serial number for documents 2018-07-06 13:25:02 +02:00
Daniel Quinn
e7e69d3f6f Remove emoji from storage-type changer 2018-06-17 17:23:50 +01:00
Daniel Quinn
044d707c40 Update import & export to handle encryption toggle 2018-06-17 17:06:22 +01:00
Daniel Quinn
e7fefc40fe Merge branch 'master' into mcronce-disable_encryption 2018-06-17 16:32:51 +01:00
Daniel Quinn
d1b6e9329f It's exist_ok=, not exists_ok= -- my bad. 2018-05-28 13:08:00 +01:00
Daniel Quinn
4576541c28 Add script to (de|en)crypt all documents 2018-05-28 12:58:28 +01:00
Erik Arvstedt
d132e2b9f5 fixup: remove helper fn 'make_dirs' 2018-05-21 00:45:00 +02:00
Erik Arvstedt
3db175dfe2 Add inotify support 2018-05-11 14:14:50 +02:00
Erik Arvstedt
b74b47423d Consumer loop: make sleep duration dynamic
Make the sleep duration dynamic to account for the time spent in
loop_step.
This improves responsiveness when repeatedly consuming newly
arriving docs.

Use float epoch seconds (time.time()) as the time type for
MailFetcher.last_checked to allow for natural time arithmetic.
2018-05-11 14:14:50 +02:00
Erik Arvstedt
aac17670de Refactor: renamings, extract fn 'loop'
Renamings:
loop -> loop_step
delta -> next_mail_time (this variable names a point in time, not a duration)

Extracting the 'loop' fn is a preparation for later commits where a
second type of loop is added.
2018-05-11 14:14:25 +02:00
Erik Arvstedt
f56ec70aad Ensure docs have been unmodified for some time before consuming
Previously, the second mtime check for new files usually happened right
after the first one, which could have caused consumption of docs that
were still being modified.

We're now waiting for at least FILES_MIN_UNMODIFIED_DURATION (0.5s).

This also cleans up the logic by eliminating the consumer.stats attribute
and the weird double call to consumer.run().

Additionally, this a fixes memory leak in consumer.stats where paths could be
added but never removed if the corresponding files disappeared from
the consumer dir before being considered ready.
2018-05-11 14:05:29 +02:00
Erik Arvstedt
9320230100 Refactor: extract fn 'make_dirs' 2018-05-11 14:04:36 +02:00
Daniel Quinn
19209ba5af Run a --oneshot loop twice
This was necessary since the first loop only ever collects file
statistics so that the second run can be sure about "readiness".
2018-03-03 18:43:20 +00:00
Ovv
340855cd87 Help & documentation 2018-03-03 18:43:20 +00:00
Ovv
b10c2c770c style & test 2018-03-03 18:43:20 +00:00
Ovv
d89dbbe537 Configuration cli argument for document_consumer 2018-03-03 18:43:20 +00:00
Daniel Quinn
345bc97c8c Updated for style and to add a --use-first option 2018-02-08 20:03:29 +00:00
Dashie
3df9ea3b26 Fix line length 2018-02-08 19:34:48 +00:00
Dashie
73a9a23860 Add manager command to re-tag documents without correspondent 2018-02-08 19:34:48 +00:00
pzl
7a01005989 small typo in exporter thumbnail filename 2018-01-19 14:28:46 -05:00
David Martin
91cebb5567 Fetch emails right at startup instead of waiting for 10 minutes.
Especially when first setting up the configuration for consuming
documents from emails it makes sense to quickly test the changes. Having
to wait for 10 minutes is not acceptable.

There are two ways around it that come to my mind: the simple approach
is to always fetch the emails when Paperless first starts. This way the
fetching of emails can be tested straight away.
The alternative would be to have a configuration option that allows to
set the interval in which emails are checked. The user could then reduce
it to test the setup and increase it again later on. This seems
needlessly complicated though, so fetching at startup it is.
2017-05-21 14:23:46 +10:00
CkuT
cabb9b5096 Use relatives paths instead of absolutes paths for document export/import 2017-05-08 15:23:35 +02:00
CkuT
a4f389de36 Refactor to get the document time once 2017-05-08 15:02:59 +02:00
CkuT
909fa3579c Use constants for manifest 2017-05-08 14:54:48 +02:00
CkuT
4c4255172f Add thumbnail export 2017-05-06 15:14:36 +02:00
CkuT
0057feefd1 Fix the source file checking 2017-05-06 15:04:47 +02:00
Daniel Quinn
23bd887f16 Consumer loop time is now configurable 2017-01-01 18:41:06 +00:00