Trenton Holmes
27d1d790f9
Try waiting a little bit after a parser error during the live testing
2022-11-02 15:55:12 -07:00
Trenton Holmes
59ea37f09c
No need for an extra import, the object is smart already
2022-11-01 08:44:30 -07:00
Trenton Holmes
f8c3f12146
Adds more options for the filename formatting
2022-11-01 08:44:30 -07:00
Max Bachmann
4a061c38d7
directly use rapidfuzz
2022-10-31 13:17:10 -07:00
Sblop
91a1d8f5ae
Update settings.py
...
Comment too long.
2022-10-28 14:40:48 -07:00
Sblop
fcc9be619a
Update settings.py
2022-10-28 14:40:48 -07:00
Sblop
a214b7a861
Update settings.py
...
Django gives a system error on MariaDB on VARCHARs longer than 255 chars. This was a limitation in older versions of mysql.
Meaning: You cannot run Paperless-NGX on older version were this limitation were present, meaning Django plays it extremely safe by giving an error.
This fixes this problem.
2022-10-28 14:40:48 -07:00
Paperless-ngx Translation Bot [bot]
dcd909e71a
New translations django.po (Dutch)
...
[ci skip]
2022-10-27 02:51:17 -07:00
Trenton H
1e1f0347fa
More smoothly handle the case of a password protected PDF for barcodes
2022-10-24 13:16:14 -07:00
Trenton H
6d2851c693
Allows using pdf2image instead of pikepdf if desired
2022-10-24 09:58:34 -07:00
Trenton H
20b7287dc2
Connects up the celery signals to support pending, started and success/failure, without relying on django-celery-results
2022-10-24 09:10:10 -07:00
Paperless-ngx Translation Bot [bot]
11e04a32a8
New translations django.po (Arabic)
...
[ci skip]
2022-10-20 16:30:32 -07:00
Michael Shamoon
a44dc23979
Update django.po
...
[ci skip]
2022-10-20 15:33:16 -07:00
Michael Shamoon
54e9e60dd3
rename backend Arabic translation file
...
[ci skip]
2022-10-20 15:31:28 -07:00
Paperless-ngx Translation Bot [bot]
6884de3c33
New translations django.po (Arabic)
...
[ci skip]
2022-10-20 15:28:49 -07:00
Trenton Holmes
1be8f39aa0
Reverts the change around skip_noarchive to align with how it is documented to work
2022-10-20 13:34:41 -07:00
Paperless-ngx Translation Bot [bot]
5f1492f900
New translations django.po (Czech)
...
[ci skip]
2022-10-18 13:06:11 -07:00
Paperless-ngx Translation Bot [bot]
33a7177867
New translations django.po (Belarusian)
...
[ci skip]
2022-10-16 06:46:10 -07:00
Paperless-ngx Translation Bot [bot]
5d048bc569
New translations django.po (Belarusian)
...
[ci skip]
2022-10-16 05:44:52 -07:00
Trenton Holmes
ddef90d96e
Adds specific handling for CCITT Group 4, which pikepdf decodes, but not correctly
2022-10-11 13:51:14 -07:00
Trenton H
c888b3dfd3
In case pikepdf fails to convert an image to a PIL image, fall back to converting pages to PIL images
2022-10-11 13:51:13 -07:00
Trenton H
0c08b16402
Catch the new error raised by redis when it can't find the broker and stub out the call for testing
2022-10-10 14:21:42 -07:00
Trenton H
4994df2e3c
Fixes usage of a depracated logger method
2022-10-10 14:20:19 -07:00
Trenton H
e88d911984
Account for plusses in the OCR language setting
2022-10-10 08:58:23 -07:00
Trenton H
2d71415ede
Allows disabling NLTK, adds it as a consideration for low power devices
2022-10-10 08:58:23 -07:00
Trenton Holmes
a78d44ec5f
Changes the NLTK language to be based on the Tesseract OCR language, with fallback to the default processing
2022-10-10 08:58:23 -07:00
Trenton H
0bc13c2a72
Allows configuration of the NLTK processing language
2022-10-10 08:58:23 -07:00
Trenton Holmes
70b1988a55
Fixes the download and usage of the downloaded data
2022-10-10 08:58:23 -07:00
Trenton Holmes
3c12f13df2
Missed one mock
2022-10-10 08:58:23 -07:00
Trenton Holmes
d334eec321
Mock out the nltk portions so the data doesn't need to be downloaded
2022-10-10 08:58:23 -07:00
Trenton Holmes
66884ea035
Updates the pre-processing of document content to be much more robust, with tokenization, stemming and stop word removal
2022-10-10 08:58:23 -07:00
shamoon
3b1f5668f5
Merge pull request #1648 from paperless-ngx/feature-use-celery
...
Feature: Transition to celery for background tasks
2022-10-10 00:07:55 -07:00
Trenton Holmes
397179ebe9
Fixes is_relative_to not being availible for 3.8
2022-10-09 17:43:58 -07:00
Trenton Holmes
3b5a7fbd65
If the original file containing a barcode was in the temporary scratch dir, move the split files to consume dir
2022-10-09 17:43:58 -07:00
Trenton H
406dae1fca
Enable some testing against a real email server to hopefully catch things earlier
2022-10-07 18:28:11 -07:00
Trenton H
6c269391d5
Fixes handling of gmail label extension to IMAP
2022-10-07 18:28:11 -07:00
Trenton Holmes
d13ca98223
Enables some basic live testing against a tika server with actual sample documents to catch some more errors mocking won't catch
2022-10-07 18:06:06 -07:00
shamoon
962d0bc57b
Merge pull request #1692 from paperless-ngx/feature-frontend-update-checking
...
Feature: frontend update checking settings
2022-10-05 13:46:32 -07:00
Michael Shamoon
f587e36744
Use text mime type for csv files for browser preview
...
Co-Authored-By: Trenton H <797416+stumpylog@users.noreply.github.com>
Co-Authored-By: bin101 <12427722+bin101@users.noreply.github.com>
2022-10-04 13:01:06 -07:00
Trenton H
8111723130
Fixes migration error if some tasks are defined already
2022-10-04 07:56:40 -07:00
shamoon
60a67fd279
Merge branch 'dev' into feature-use-celery
2022-10-03 18:00:54 -07:00
Trenton H
5ad0d7303d
Fixes up some issues with the migrations and type mismatches
2022-10-03 13:18:25 -07:00
Jens van Almsick
d89443b31d
fix: csv recognition by consumer
...
paperless-ngx detects the file format via the mime-type based on the response of python-magic which rely on the response of the file command.
In version 5.39 (which is shipped with debian bullseye and I think many more non-rolling distributions) of the file command a *.csv will be detected as "application/csv" instead of "text/csv" as in newer versions.
2022-10-02 16:09:07 -07:00
Michael Shamoon
5c39f058ad
add id to document duplicate error message
2022-10-02 10:27:45 -07:00
Trenton Holmes
a19fbf919d
When a document is a duplicate, include the title of the existing document in the fail message
2022-10-02 10:27:45 -07:00
Michael Shamoon
4dac5fa02d
Fix python + frontend tests
2022-09-30 18:32:21 -07:00
Michael Shamoon
f0cd10e960
Better migration of update checking settings, offer reload, strip backend_setting from db
2022-09-30 14:03:59 -07:00
Michael Shamoon
4eaeac3610
Frontend update checking settings
2022-09-30 12:30:23 -07:00
Trenton H
16980e0c1d
Changes MariaDB encoding to use utf8mb4
2022-09-29 13:53:44 -07:00
Trenton H
4621c1a83a
Fix logger location tag
...
Co-authored-by: shamoon <4887959+shamoon@users.noreply.github.com>
2022-09-28 11:02:34 -07:00