Trenton H
|
79aecebbd2
|
In the case of an RTL language being extracted via pdfminer.six, fall back to forced OCR, which handles RTL text better
|
2022-12-29 16:02:02 -08:00 |
|
Trenton H
|
68c62f3857
|
Allows parsing of WebP format images
|
2022-11-28 09:35:54 -08:00 |
|
Trenton H
|
ffd9cd721d
|
Adds a test to cover this edge case
|
2022-11-22 07:22:41 -08:00 |
|
Trenton Holmes
|
1be8f39aa0
|
Reverts the change around skip_noarchive to align with how it is documented to work
|
2022-10-20 13:34:41 -07:00 |
|
Trenton Holmes
|
43d2545321
|
Fixes the creation of an archive file, even if noarchive was specified
|
2022-08-20 13:47:56 -07:00 |
|
Trenton Holmes
|
8660103563
|
Changes the simple-alpha parsing test to use a tempdir so the original isn't modified in Git
|
2022-07-02 16:19:22 +02:00 |
|
Trenton Holmes
|
6635fa5f0d
|
Runs the pre-commit hooks over all the Python files
|
2022-03-11 11:34:28 -08:00 |
|
kpj
|
c56cb25b5f
|
Format Python code with black
|
2022-02-27 15:26:41 +01:00 |
|
Martin Müller
|
a662ce03ea
|
Modify test for PNG image with alpha
|
2022-02-21 22:38:25 +01:00 |
|
jonaswinkler
|
c9d76322eb
|
also apply \0 removal to sidecar contents
|
2021-03-22 23:08:34 +01:00 |
|
jonaswinkler
|
3a67462396
|
fixes #631
|
2021-03-14 14:42:48 +01:00 |
|
jonaswinkler
|
81b787635e
|
update dependencies
|
2021-02-28 13:01:26 +01:00 |
|
jonaswinkler
|
96088716d9
|
tests
|
2021-02-22 00:17:16 +01:00 |
|
jonaswinkler
|
26c65b29d5
|
tests
|
2021-02-21 00:18:34 +01:00 |
|
jonaswinkler
|
94cc9876d9
|
local import of ocrmypdf so that the webserver does not load that
|
2021-02-15 12:18:10 +01:00 |
|
jonaswinkler
|
bee7a06e41
|
fix bugs and test cases
|
2021-01-02 15:37:27 +01:00 |
|
jonaswinkler
|
45d31f9735
|
fixes bauerj/paperless_app#23 and most of all other scanner apps out there.
|
2020-12-12 18:25:15 +01:00 |
|
jonaswinkler
|
1d073d2cfd
|
a couple fixes and more supported image files
|
2020-12-02 17:39:49 +01:00 |
|
jonaswinkler
|
0fb294d556
|
testing the new noarchive option.
|
2020-12-01 14:30:13 +01:00 |
|
jonaswinkler
|
20cc7e3dc0
|
more tests!
|
2020-11-29 19:58:48 +01:00 |
|
jonaswinkler
|
99e6906b51
|
test case fixes.
|
2020-11-27 14:06:37 +01:00 |
|
Jonas Winkler
|
f901def797
|
more tests of the new parser
|
2020-11-26 00:08:23 +01:00 |
|
Jonas Winkler
|
c00c63c639
|
fixed the test cases
|
2020-11-25 19:51:09 +01:00 |
|
Jonas Winkler
|
f5656222e2
|
removed obsolete tests.
|
2020-11-25 14:51:32 +01:00 |
|
Jonas Winkler
|
cbee56ae8c
|
testing the tesseract parser
|
2020-11-19 20:31:08 +01:00 |
|