Jonas Winkler
0dc3644cc1
Added missing dependencies
2018-09-12 17:43:13 +02:00
Jonas Winkler
7c589f71a4
Fixed a few minor issues.
2018-09-12 16:25:23 +02:00
Jonas Winkler
25a6aa909b
removed duplicate code
2018-09-12 13:43:28 +02:00
Jonas Winkler
ef0d37985b
Merge branch 'master' into dev
2018-09-12 11:47:35 +02:00
Jonas Winkler
898931cc03
bugfix
2018-09-11 20:45:36 +02:00
Jonas Winkler
17803e7936
fixed settings
2018-09-11 17:30:46 +02:00
Jonas Winkler
e72735c4f0
Merge remote-tracking branch 'upstream/master'
2018-09-11 14:43:59 +02:00
Jonas Winkler
46a5bc00d7
Merge branch 'machine-learning' into dev
2018-09-11 14:36:21 +02:00
Jonas Winkler
d46ee11143
The classifier works with ids now, not names. Minor changes.
2018-09-11 14:30:18 +02:00
Jonas Winkler
d2534a73e5
changed classifier
2018-09-11 00:33:07 +02:00
Daniel Quinn
2edf65dd1e
Bump to 2.3.0
2.3.0
2018-09-09 21:51:44 +01:00
Daniel Quinn
9a739bdbab
Merge pull request #401 from ahyear/patch-1
...
add migrate commande to docker update process
2018-09-09 21:26:56 +01:00
Daniel Quinn
66db06590d
Merge branch 'jat255-ENH_config_inline_or_attach'
2018-09-09 21:22:42 +01:00
Daniel Quinn
7cef108785
Streamline how we handle boolean values in settings.py
2018-09-09 21:22:07 +01:00
Daniel Quinn
a86a20ef0f
Make the example file contain the default value
2018-09-09 21:16:53 +01:00
Daniel Quinn
f94347abc0
Merge branch 'ENH_config_inline_or_attach' of git://github.com/jat255/paperless into jat255-ENH_config_inline_or_attach
2018-09-09 21:15:14 +01:00
Daniel Quinn
46cbd10ba0
Merge pull request #399 from jat255/ENH_convert_only_one_page
...
Speed up thumbnail generation for PDFs
2018-09-09 21:12:42 +01:00
Daniel Quinn
2a96c648e8
Merge pull request #396 from dubit0/postgres_mysql_fix
...
Fix document checks with PostgreSQL and MySQL backends.
2018-09-09 21:10:36 +01:00
Daniel Quinn
75648cc74b
Merge branch 'jat255-ENH_text_consumer'
2018-09-09 21:03:58 +01:00
Daniel Quinn
0472fe4e9e
Reorder imports
2018-09-09 21:03:37 +01:00
Daniel Quinn
c99f5923d5
Rename parsers
to DATE_REGEX
...
In moving the `parsers` variable into the package-level, it lost the
context, so a more descriptive name was needed.
2018-09-09 21:02:30 +01:00
Daniel Quinn
ef302abed7
Fix pycodestyle complaints
2018-09-09 20:55:37 +01:00
Daniel Quinn
2dc35cc856
Merge branch 'ENH_text_consumer' of git://github.com/jat255/paperless into jat255-ENH_text_consumer
2018-09-09 20:52:59 +01:00
Daniel Quinn
f4c399f0dd
Merge pull request #398 from ddddavidmartin/bump_pyocr_version_for_tesseract_4_support
...
Bump required version for Pyocr to support the latest tesseract 4.
2018-09-09 20:01:51 +01:00
Daniel Quinn
5342db6ada
Fix pycodestyle complaints
...
Apparently, pycodestyle updated itself to now check for invalid escape
sequences, which only complain if the regex in use isn't a raw string
(r"").
2018-09-09 20:00:12 +01:00
Daniel Quinn
5c39fff51b
Add tox to dev dependencies
2018-09-09 19:59:47 +01:00
ahyear
ed0e40d3e6
add migrate commande to docker update process
2018-09-06 15:32:41 +02:00
Jonas Winkler
11adc94e5e
mode change
2018-09-06 12:00:01 +02:00
Jonas Winkler
04bf5fc094
fixed merge error
2018-09-06 10:15:15 +02:00
Joshua Taillon
652ead2f5c
remove debugging print statement
2018-09-05 23:05:37 -04:00
Joshua Taillon
be9757894a
add INLINE_DOC to settings.py
2018-09-05 23:03:30 -04:00
Joshua Taillon
22378789e2
add option for inline vs. attachment for document rendering
2018-09-05 22:58:38 -04:00
Joshua Taillon
72c828170e
move date-matching regex pattern to base parser module for use by all subclasses
2018-09-05 21:13:36 -04:00
Jonas Winkler
d26f940a91
Merge branch 'dev' into machine-learning
2018-09-06 00:29:41 +02:00
Jonas Winkler
13725ef8ee
Merge branch 'master' into dev
2018-09-06 00:28:58 +02:00
Jonas Winkler
6f0ca432c4
Added scikit-learn to requirements
2018-09-06 00:20:44 +02:00
Joshua Taillon
cac63494f0
change tesseract parser to only convert first page to save (potentially) massive amounts of work
2018-09-05 15:18:35 -04:00
Jonas Winkler
dd8746bac7
fixed the api
2018-09-05 15:29:05 +02:00
Jonas Winkler
8eeded95c4
Merge branch 'dev' into machine-learning
2018-09-05 15:26:39 +02:00
Jonas Winkler
131e1c9dd8
fixed the api
2018-09-05 15:25:14 +02:00
Jonas Winkler
a6b4fc7e81
fixed api
2018-09-05 14:57:37 +02:00
Jonas Winkler
cea880f245
implemented automatic classification field functionality
2018-09-05 14:31:02 +02:00
Jonas Winkler
82bc0e3368
Fixed a few things
2018-09-05 12:43:11 +02:00
Daniel Quinn
939a67bd4b
Add empty requirements for rtd to reference
2018-09-05 11:16:42 +01:00
Daniel Quinn
fbc6a58f5a
Add credits for 2.2.0 that I forgot
2018-09-05 10:59:06 +01:00
Daniel Quinn
01a358d2b0
Re-flow text to keep it <80c wide
2018-09-05 10:58:41 +01:00
David Martin
6b447628ed
Bump required version for Pyocr to support the latest tesseract 4.
...
This recently changed in the official tesseract engine [0]. -psm is
not allowed as an option anymore and --psm has to be used instead. The
latest pyocr enables support for this [1].
[0] tesseract-ocr/tesseract@ee201e1
[1] 5abd0a566a
2018-09-05 13:03:42 +10:00
Thomas Niederprüm
2308d5a613
Catch ProgrammingError in Document checks.
...
When running PostgreSQL or MariaDB/MySQL backends, a query to a non-existent
table will raise a "ProgrammingError". This patch properly catches this error.
Without this patch all management calls to manage.py will lead to an error when
running PostgreSQL or MariaDB as a backend.
2018-09-04 20:11:48 +02:00
Jonas Winkler
70bd05450a
removed matching model fields, automatic classifier reloading, added autmatic_classification field to matching model
2018-09-04 18:40:26 +02:00
Jonas Winkler
c765ef5eeb
Merge remote-tracking branch 'upstream/master'
2018-09-04 16:02:48 +02:00