274 Commits

Author SHA1 Message Date
Daniel Quinn
1ef119eb82 Added basic filtering + search 2016-03-09 01:05:46 +00:00
Pit Kleyersburg
fb36a49c26 Add unpaper as another pre-processing step 2016-03-06 15:30:37 +01:00
Daniel Quinn
0aead1fbe6 #68: Using dotenv for a proper unix config file 2016-03-03 17:59:27 +00:00
Daniel Quinn
857c7ac654 #44: Harmonise environment variables with constant names 2016-03-01 22:39:40 +00:00
Daniel Quinn
26c3781350 #44: Harmonise environment variables with constant names 2016-03-01 22:37:42 +00:00
Daniel Quinn
2fe9b0cbc1 New logging appears to work 2016-02-27 20:18:50 +00:00
Daniel Quinn
55a7dc2444 pep8 2016-02-21 01:43:48 +00:00
Daniel Quinn
809fb8fa1f Moved the default GNUPG home to /tmp for tox-friendliness 2016-02-21 00:29:59 +00:00
Daniel Quinn
cebc44f2c9 API is halfway there 2016-02-16 09:28:34 +00:00
Daniel Quinn
7843ea5037 Added and implemented a rudimentary logger 2016-02-14 16:09:52 +00:00
Pit Kleyersburg
f5beda9c56 Enable parallel OCR processing
At the moment, every page in a PDF will be processed one by one using
tesseract. Since the processing of a single page is independent from every
other page, one can make use of multi-core machines.

This PR introduces a multiprocessing pool to process multiple pages
simultaneously. The amount of threads to use can be specified in the
environment variable `PAPERLESS_OCR_THREADS`. This will default to the
number of cores/hyperthreads Python detects for your system.
2016-02-14 15:57:42 +01:00
Daniel Quinn
0eaed36420 The 'API' is written but untested 2016-02-08 23:46:16 +00:00
Daniel Quinn
d8ad6b589b Added pytest and broke up the consumer into file and mail 2016-02-05 00:23:36 +00:00
Daniel Quinn
a70b40f618 Broke the consumer script into separate files and started on a mail consumer 2016-01-30 01:18:52 +00:00
Daniel Quinn
fbbaf9cce0 Organised and documented project settings 2016-01-23 02:28:39 +00:00
the01
4c1ff658d2 add language setting for tesseract 2016-01-21 09:24:13 +01:00
Daniel Quinn
08270b1aaa Having debug off is too much of a pain in the ass 2016-01-17 02:10:07 +00:00
Daniel Quinn
d89349f272 No need for the debug flag by default 2016-01-15 18:19:53 +00:00
Daniel Quinn
17615d43cb Fixed a few consumer bugs and added an exporter
Rename exporter to export and fixt some debugging

Account for files not matching the sender/title pattern

Added a safety note

Wrong regex on the name parser

Renamed the command to something slightly less ambiguous
2016-01-15 18:14:42 +00:00
Daniel Quinn
97e9cc37a9 Set CONSUMPTION_DIR to something user-editable 2016-01-10 23:17:12 +00:00
Daniel Quinn
f620762456 Safer place for the database 2016-01-10 22:45:15 +00:00
Daniel Quinn
fe47f9f07e Better handling of the passphrase 2016-01-10 13:40:26 +00:00
Daniel Quinn
f72c515742 Added GPG encryption for the PDFs 2016-01-01 16:13:59 +00:00
Daniel Quinn
855ee64097 It works! 2015-12-20 19:23:33 +00:00