32 Commits

Author SHA1 Message Date
Jonas Winkler
41650f20f4 mime type handling 2020-11-20 13:31:03 +01:00
Jonas Winkler
727f86c369 codestyle 2020-11-18 22:41:14 +01:00
Jonas Winkler
bd322a0ce6 fixed test case. 2020-11-17 18:35:45 +01:00
Jonas Winkler
c7c6be42be refactor 2020-11-17 11:49:44 +01:00
Jonas Winkler
70d8e8bc56 added more testing 2020-11-16 23:16:37 +01:00
Jonas Winkler
6d14e111b6 fixed most of the test cases 2020-11-08 13:49:15 +01:00
Michael Gmelin
4f85d9ed9f Add unit test for PAPERLESS_FILENAME_PARSE_TRANSFORMS feature. 2019-09-08 20:58:13 +02:00
Daniel Quinn
f1e1bb4deb Fix #384: duplicate tags due to case insensitivity 2018-09-02 20:48:51 +01:00
Daniel Quinn
20a4a66a57 Clean up test formatting a bit 2018-04-22 16:28:21 +01:00
Daniel Quinn
7223ea3c3f Don't explode on invalid dates 2018-04-22 16:27:43 +01:00
Ovv
f8c6c07bb7 use tmp dir 2018-03-03 18:43:20 +00:00
Ovv
8fefafb844 style & test 2018-03-03 18:43:20 +00:00
Daniel Quinn
ede274386b Detect .tif files properly
Fixes #232
2017-07-15 19:02:11 +01:00
Daniel Quinn
6af58203dd fix: travis doesn't like my new tests 2017-03-28 21:23:42 +00:00
Daniel Quinn
fa4924d5ba fix: allow for caps in file name suffixes #206
@schinkelg ran aground of this one and I took the opportunity to add a
test to catch this sort of thing for next time.
2017-03-28 21:14:24 +00:00
Daniel Quinn
7611c2b3d5 fix: pep8 + travis & tox env updates 2017-03-25 15:10:51 +00:00
Daniel Quinn
55e81ca4bb feat: refactor for pluggable consumers
I've broken out the OCR-specific code from the consumers and dumped it
all into its own app, `paperless_tesseract`.  This new app should serve
as a sample of how to create one's own consumer for different file
types.

Documentation for how to do this isn't ready yet, but for the impatient:

* Create a new app
    * containing a `parsers.py` for your parser modelled after
      `paperless_tesseract.parsers.RasterisedDocumentParser`
    * containing a `signals.py` with a handler moddelled after
      `paperless_tesseract.signals.ConsumerDeclaration`
    * connect the signal handler to
      `documents.signals.document_consumer_declaration` in
      `your_app.apps`
* Install the app into Paperless by declaring
  `PAPERLESS_INSTALLED_APPS=your_app`.  Additional apps should be
  separated with commas.
* Restart the consumer
2017-03-25 15:10:25 +00:00
Daniel Quinn
fddd330e75 Fixed reference to wrong file 2017-01-01 16:40:29 +00:00
Daniel Quinn
6183e1ce5f pep8 2016-11-27 15:10:07 +00:00
Daniel Quinn
18495ce9da Fix for #154
* Added a test with a faked pyocr and tesseract
* Added a catch for pyocr's *other* TesseractError
2016-11-27 15:06:45 +00:00
Daniel Quinn
8e58406881 pep8 corrections 2016-10-26 09:32:59 +00:00
Aleksandr Bogdanov
63de2ca1b0 Collapsing excess whitespace after OCR 2016-10-12 01:46:34 +02:00
Daniel Quinn
1ce76a5486 Actually write the date found in the file name 2016-08-20 18:11:51 +01:00
Daniel Quinn
f5daded930 Fix for #131: delete files on document.delete 2016-08-16 19:13:37 +01:00
Daniel Quinn
0aa0513004 Modifications for support for dates 2016-03-24 19:18:33 +00:00
Tikitu de Jager
a22f088e28 Add some failing edge case tests 2016-03-07 21:48:47 +02:00
Tikitu de Jager
8afdcabca8 Template-based tests of combinations of valid elements 2016-03-07 21:47:18 +02:00
Tikitu de Jager
ad07eec3e1 Make tests pass 2016-03-07 21:37:18 +02:00
Daniel Quinn
1aecb1e63a Compensate for case and format of jpg vs. jpeg 2016-02-23 20:15:13 +00:00
Daniel Quinn
422ae9303a pep8 2016-02-21 00:14:50 +00:00
Daniel Quinn
a022fcb8f1 Fixed the auto-naming regexes 2016-02-11 22:05:55 +00:00
Daniel Quinn
ef1639208c Tests for the consumer 2016-02-11 12:25:23 +00:00