jonaswinkler
40f58e0b79
fix test cases
2021-01-26 15:26:25 +01:00
jonaswinkler
c0f185fe7e
bug fixes, test case fixes
2021-01-26 15:19:56 +01:00
jonaswinkler
5ac567d300
tests for pre and post consume script
2021-01-06 14:08:44 +01:00
jonaswinkler
73638b626c
fix test case
2021-01-02 15:52:02 +01:00
jonaswinkler
804e65f553
fix #236
2021-01-01 23:27:55 +01:00
jonaswinkler
755f950cd2
supply file_name for tika parser
2021-01-01 22:19:43 +01:00
jonaswinkler
b70f7ca8f3
fix test case.
2020-12-27 14:50:57 +01:00
jonaswinkler
bd0c9721c4
removed most of the logic that extracts data from filename patterns #156
2020-12-20 00:08:05 +01:00
jonaswinkler
a8747f6bc0
test CONSUMER_DELETE_DUPLICATES
2020-12-20 00:06:33 +01:00
jonaswinkler
ba7bf9b2d2
removed slugs entirely, since their only purpose was purely cosmetic anyway.
2020-12-09 00:04:37 +01:00
jonaswinkler
638113864f
fixes #90
2020-12-08 13:54:49 +01:00
jonaswinkler
655ce81cb6
changes to filename generation, partially addresses #90
2020-12-06 16:13:37 +01:00
jonaswinkler
e4eeb29f54
checking file types against parsers in the consumer.
2020-12-01 15:26:05 +01:00
jonaswinkler
e528a8446e
Merge branch 'dev' into feature-ocrmypdf
2020-11-30 23:53:19 +01:00
jonaswinkler
5af70f4dad
more testing.
2020-11-30 22:04:25 +01:00
jonaswinkler
1df64e3129
Merge branch 'dev' into feature-ocrmypdf
2020-11-30 16:48:09 +01:00
jonaswinkler
7658c07b4d
added file type checks to the parsers to prevent temporary files from being consumed. Also: parsers announce file types they wish to use as default for each mime type.
2020-11-30 00:40:04 +01:00
jonaswinkler
cb959e296a
more tests!
2020-11-29 19:22:49 +01:00
jonaswinkler
ddb3ef49f6
Merge branch 'dev' into feature-ocrmypdf
2020-11-29 18:37:38 +01:00
jonaswinkler
0e9d88ef7d
test cases for #67
2020-11-29 15:47:56 +01:00
jonaswinkler
7bba3065fe
Merge branch 'dev' into feature-ocrmypdf
2020-11-27 14:03:19 +01:00
jonaswinkler
d2e919849a
refactored the test cases to use a mixin for setting up temporary directories.
2020-11-27 14:00:41 +01:00
jonaswinkler
72b4f817df
moved consumption dir check into the correct spot
2020-11-27 13:12:13 +01:00
jonaswinkler
2b1c11503a
Pipfile.lock post merge
2020-11-27 00:10:40 +01:00
jonaswinkler
3daee6b5b5
Merge branch 'dev' into feature-ocrmypdf
2020-11-27 00:06:20 +01:00
jonaswinkler
4ab7577069
improved test cases. Python 3.6 compatibility.
2020-11-26 22:17:14 +01:00
Jonas Winkler
c00c63c639
fixed the test cases
2020-11-25 19:51:09 +01:00
Jonas Winkler
fc0ba2098a
FileType does not care about the extension anymore.
2020-11-20 16:18:59 +01:00
Jonas Winkler
f976a0b4ba
mime type handling
2020-11-20 13:31:03 +01:00
Jonas Winkler
8c40c54421
codestyle
2020-11-18 22:41:14 +01:00
Jonas Winkler
00d2dc7fef
fixed test case.
2020-11-17 18:35:45 +01:00
Jonas Winkler
39ba14aac1
refactor
2020-11-17 11:49:44 +01:00
Jonas Winkler
e30f0b274b
added more testing
2020-11-16 23:16:37 +01:00
Jonas Winkler
30f837d49f
fixed most of the test cases
2020-11-08 13:49:15 +01:00
Michael Gmelin
658e73d79a
Add unit test for PAPERLESS_FILENAME_PARSE_TRANSFORMS feature.
2019-09-08 20:58:13 +02:00
Daniel Quinn
284ff69539
Fix #384 : duplicate tags due to case insensitivity
2018-09-02 20:48:51 +01:00
Daniel Quinn
1522af2ea5
Clean up test formatting a bit
2018-04-22 16:28:21 +01:00
Daniel Quinn
fe5d55d2d1
Don't explode on invalid dates
2018-04-22 16:27:43 +01:00
Ovv
2262a6ec28
use tmp dir
2018-03-03 18:43:20 +00:00
Ovv
b10c2c770c
style & test
2018-03-03 18:43:20 +00:00
Daniel Quinn
cd21c9c451
Detect .tif files properly
...
Fixes #232
2017-07-15 19:02:11 +01:00
Daniel Quinn
a5675edd6b
fix: travis doesn't like my new tests
2017-03-28 21:23:42 +00:00
Daniel Quinn
e7d4ca92ba
fix: allow for caps in file name suffixes #206
...
@schinkelg ran aground of this one and I took the opportunity to add a
test to catch this sort of thing for next time.
2017-03-28 21:14:24 +00:00
Daniel Quinn
083ed3c7a3
fix: pep8 + travis & tox env updates
2017-03-25 15:10:51 +00:00
Daniel Quinn
d2c283582b
feat: refactor for pluggable consumers
...
I've broken out the OCR-specific code from the consumers and dumped it
all into its own app, `paperless_tesseract`. This new app should serve
as a sample of how to create one's own consumer for different file
types.
Documentation for how to do this isn't ready yet, but for the impatient:
* Create a new app
* containing a `parsers.py` for your parser modelled after
`paperless_tesseract.parsers.RasterisedDocumentParser`
* containing a `signals.py` with a handler moddelled after
`paperless_tesseract.signals.ConsumerDeclaration`
* connect the signal handler to
`documents.signals.document_consumer_declaration` in
`your_app.apps`
* Install the app into Paperless by declaring
`PAPERLESS_INSTALLED_APPS=your_app`. Additional apps should be
separated with commas.
* Restart the consumer
2017-03-25 15:10:25 +00:00
Daniel Quinn
fddd330e75
Fixed reference to wrong file
2017-01-01 16:40:29 +00:00
Daniel Quinn
6183e1ce5f
pep8
2016-11-27 15:10:07 +00:00
Daniel Quinn
18495ce9da
Fix for #154
...
* Added a test with a faked pyocr and tesseract
* Added a catch for pyocr's *other* TesseractError
2016-11-27 15:06:45 +00:00
Daniel Quinn
8e58406881
pep8 corrections
2016-10-26 09:32:59 +00:00
Aleksandr Bogdanov
63de2ca1b0
Collapsing excess whitespace after OCR
2016-10-12 01:46:34 +02:00