Trenton H
d52fbbb040
More smoothly handle the case of a password protected PDF for barcodes
2022-10-24 13:16:14 -07:00
Trenton H
f8ce6285df
Allows using pdf2image instead of pikepdf if desired
2022-10-24 09:58:34 -07:00
Trenton H
a72cc5da83
Connects up the celery signals to support pending, started and success/failure, without relying on django-celery-results
2022-10-24 09:10:10 -07:00
Michael Shamoon
8be6c707de
Update django.po
...
[ci skip]
2022-10-20 15:33:16 -07:00
Michael Shamoon
60f76d3e1f
rename backend Arabic translation file
...
[ci skip]
2022-10-20 15:31:28 -07:00
Trenton Holmes
d1aa08850d
Reverts the change around skip_noarchive to align with how it is documented to work
2022-10-20 13:34:41 -07:00
Trenton Holmes
4cc2976614
Adds specific handling for CCITT Group 4, which pikepdf decodes, but not correctly
2022-10-11 13:51:14 -07:00
Trenton H
caf4b54bc7
In case pikepdf fails to convert an image to a PIL image, fall back to converting pages to PIL images
2022-10-11 13:51:13 -07:00
Trenton H
8025df5fe3
Catch the new error raised by redis when it can't find the broker and stub out the call for testing
2022-10-10 14:21:42 -07:00
Trenton H
5aeb656a48
Fixes usage of a depracated logger method
2022-10-10 14:20:19 -07:00
Trenton H
d1a17480ea
Account for plusses in the OCR language setting
2022-10-10 08:58:23 -07:00
Trenton H
1e891414a3
Allows disabling NLTK, adds it as a consideration for low power devices
2022-10-10 08:58:23 -07:00
Trenton Holmes
c44c914d3d
Changes the NLTK language to be based on the Tesseract OCR language, with fallback to the default processing
2022-10-10 08:58:23 -07:00
Trenton H
d10d2f5a54
Allows configuration of the NLTK processing language
2022-10-10 08:58:23 -07:00
Trenton Holmes
6523cf0c4b
Fixes the download and usage of the downloaded data
2022-10-10 08:58:23 -07:00
Trenton Holmes
1262c121f0
Missed one mock
2022-10-10 08:58:23 -07:00
Trenton Holmes
f7cd6974c5
Mock out the nltk portions so the data doesn't need to be downloaded
2022-10-10 08:58:23 -07:00
Trenton Holmes
d856e48045
Updates the pre-processing of document content to be much more robust, with tokenization, stemming and stop word removal
2022-10-10 08:58:23 -07:00
shamoon
6f50285f47
Merge pull request #1648 from paperless-ngx/feature-use-celery
...
Feature: Transition to celery for background tasks
2022-10-10 00:07:55 -07:00
Trenton Holmes
77b3aa5011
Fixes is_relative_to not being availible for 3.8
2022-10-09 17:43:58 -07:00
Trenton Holmes
9aefff38e7
If the original file containing a barcode was in the temporary scratch dir, move the split files to consume dir
2022-10-09 17:43:58 -07:00
Trenton H
97ceb1a8a6
Enable some testing against a real email server to hopefully catch things earlier
2022-10-07 18:28:11 -07:00
Trenton H
55089aab32
Fixes handling of gmail label extension to IMAP
2022-10-07 18:28:11 -07:00
Trenton Holmes
9c0c734b34
Enables some basic live testing against a tika server with actual sample documents to catch some more errors mocking won't catch
2022-10-07 18:06:06 -07:00
shamoon
5357775d42
Merge pull request #1692 from paperless-ngx/feature-frontend-update-checking
...
Feature: frontend update checking settings
2022-10-05 13:46:32 -07:00
Michael Shamoon
c42388f7e2
Use text mime type for csv files for browser preview
...
Co-Authored-By: Trenton H <797416+stumpylog@users.noreply.github.com>
Co-Authored-By: bin101 <12427722+bin101@users.noreply.github.com>
2022-10-04 13:01:06 -07:00
Trenton H
ff7d4d15cd
Fixes migration error if some tasks are defined already
2022-10-04 07:56:40 -07:00
shamoon
5e4a9311ed
Merge branch 'dev' into feature-use-celery
2022-10-03 18:00:54 -07:00
Trenton H
19d4b85961
Fixes up some issues with the migrations and type mismatches
2022-10-03 13:18:25 -07:00
Jens van Almsick
ad6ef7314b
fix: csv recognition by consumer
...
paperless-ngx detects the file format via the mime-type based on the response of python-magic which rely on the response of the file command.
In version 5.39 (which is shipped with debian bullseye and I think many more non-rolling distributions) of the file command a *.csv will be detected as "application/csv" instead of "text/csv" as in newer versions.
2022-10-02 16:09:07 -07:00
Michael Shamoon
11ad8ada79
add id to document duplicate error message
2022-10-02 10:27:45 -07:00
Trenton Holmes
905b28c1d7
When a document is a duplicate, include the title of the existing document in the fail message
2022-10-02 10:27:45 -07:00
Michael Shamoon
f26fda9485
Fix python + frontend tests
2022-09-30 18:32:21 -07:00
Michael Shamoon
c87f60c605
Better migration of update checking settings, offer reload, strip backend_setting from db
2022-09-30 14:03:59 -07:00
Michael Shamoon
9e2430da46
Frontend update checking settings
2022-09-30 12:30:23 -07:00
Trenton H
436f9e891e
Changes MariaDB encoding to use utf8mb4
2022-09-29 13:53:44 -07:00
Trenton H
4422bb3f69
Fix logger location tag
...
Co-authored-by: shamoon <4887959+shamoon@users.noreply.github.com>
2022-09-28 11:02:34 -07:00
Trenton H
5b66ef0a74
Updates how task_args and task_kwargs are parsed, adds testing to cover everything I can think of
2022-09-28 10:40:55 -07:00
Michael Shamoon
4fe37f6aee
Add related_document and direct link from task UI
2022-09-27 20:50:26 -07:00
Michael Shamoon
5162bdd404
Filter out old migrated tasks
2022-09-27 19:41:23 -07:00
Michael Shamoon
c8f252d165
Add document name & error result parsing to PaperlessTask serializer
2022-09-27 19:40:24 -07:00
Trenton H
14b6216b49
Ensures all existing one to one fields are nulled before altering the field
2022-09-27 14:17:42 -07:00
Trenton H
9188e25dc5
Fixes migration order back to the right way
2022-09-27 13:55:31 -07:00
Trenton H
fad1b03458
Finalizes what the PaperlessTask will look like to the frontend
2022-09-27 12:44:01 -07:00
Michael Shamoon
9d117ee11b
Merge pull request #1666 from paperless-ngx/fix/1664
2022-09-27 09:34:34 -07:00
Michael Shamoon
5bb1824613
Allow PAPERLESS_OCR_CLEAN=none
2022-09-27 08:48:04 -07:00
Trenton H
8c07b76e6a
Bumps version numbers to 1.9.2
2022-09-27 08:06:35 -07:00
Trenton Holmes
9247300230
Transitions the backend to celery and celery beat
2022-09-26 11:25:34 -07:00
Trenton H
8967f07c8d
Fixes a missing option for OCR mode and incorrect clean mode
2022-09-26 11:05:19 -07:00
Michael Shamoon
7d4ce40a37
v1.9.0
2022-09-26 07:54:10 -07:00