1321 Commits

Author SHA1 Message Date
Trenton H
189d02dfe6 Always use pikepdf, then pdf2image if needed to check for barcodes instead of requiring/allowing configuration 2022-11-09 13:01:39 -08:00
Michael Shamoon
1268daf6fe Celery JSON serializable object for override_tag_ids 2022-11-07 07:41:23 -08:00
Trenton H
cce0d363e4 Refresh the instance from the database before doing any file handling with it 2022-11-03 11:32:27 -07:00
Trenton Holmes
59ea37f09c No need for an extra import, the object is smart already 2022-11-01 08:44:30 -07:00
Trenton Holmes
f8c3f12146 Adds more options for the filename formatting 2022-11-01 08:44:30 -07:00
Max Bachmann
4a061c38d7 directly use rapidfuzz 2022-10-31 13:17:10 -07:00
Trenton H
1e1f0347fa More smoothly handle the case of a password protected PDF for barcodes 2022-10-24 13:16:14 -07:00
Trenton H
6d2851c693 Allows using pdf2image instead of pikepdf if desired 2022-10-24 09:58:34 -07:00
Trenton H
20b7287dc2 Connects up the celery signals to support pending, started and success/failure, without relying on django-celery-results 2022-10-24 09:10:10 -07:00
Trenton Holmes
ddef90d96e Adds specific handling for CCITT Group 4, which pikepdf decodes, but not correctly 2022-10-11 13:51:14 -07:00
Trenton H
c888b3dfd3 In case pikepdf fails to convert an image to a PIL image, fall back to converting pages to PIL images 2022-10-11 13:51:13 -07:00
Trenton H
0c08b16402 Catch the new error raised by redis when it can't find the broker and stub out the call for testing 2022-10-10 14:21:42 -07:00
Trenton H
4994df2e3c Fixes usage of a depracated logger method 2022-10-10 14:20:19 -07:00
Trenton H
2d71415ede Allows disabling NLTK, adds it as a consideration for low power devices 2022-10-10 08:58:23 -07:00
Trenton Holmes
a78d44ec5f Changes the NLTK language to be based on the Tesseract OCR language, with fallback to the default processing 2022-10-10 08:58:23 -07:00
Trenton H
0bc13c2a72 Allows configuration of the NLTK processing language 2022-10-10 08:58:23 -07:00
Trenton Holmes
70b1988a55 Fixes the download and usage of the downloaded data 2022-10-10 08:58:23 -07:00
Trenton Holmes
3c12f13df2 Missed one mock 2022-10-10 08:58:23 -07:00
Trenton Holmes
d334eec321 Mock out the nltk portions so the data doesn't need to be downloaded 2022-10-10 08:58:23 -07:00
Trenton Holmes
66884ea035 Updates the pre-processing of document content to be much more robust, with tokenization, stemming and stop word removal 2022-10-10 08:58:23 -07:00
shamoon
3b1f5668f5 Merge pull request #1648 from paperless-ngx/feature-use-celery
Feature: Transition to celery for background tasks
2022-10-10 00:07:55 -07:00
Trenton Holmes
397179ebe9 Fixes is_relative_to not being availible for 3.8 2022-10-09 17:43:58 -07:00
Trenton Holmes
3b5a7fbd65 If the original file containing a barcode was in the temporary scratch dir, move the split files to consume dir 2022-10-09 17:43:58 -07:00
shamoon
962d0bc57b Merge pull request #1692 from paperless-ngx/feature-frontend-update-checking
Feature: frontend update checking settings
2022-10-05 13:46:32 -07:00
Michael Shamoon
f587e36744 Use text mime type for csv files for browser preview
Co-Authored-By: Trenton H <797416+stumpylog@users.noreply.github.com>
Co-Authored-By: bin101 <12427722+bin101@users.noreply.github.com>
2022-10-04 13:01:06 -07:00
Trenton H
8111723130 Fixes migration error if some tasks are defined already 2022-10-04 07:56:40 -07:00
shamoon
60a67fd279 Merge branch 'dev' into feature-use-celery 2022-10-03 18:00:54 -07:00
Trenton H
5ad0d7303d Fixes up some issues with the migrations and type mismatches 2022-10-03 13:18:25 -07:00
Michael Shamoon
5c39f058ad add id to document duplicate error message 2022-10-02 10:27:45 -07:00
Trenton Holmes
a19fbf919d When a document is a duplicate, include the title of the existing document in the fail message 2022-10-02 10:27:45 -07:00
Michael Shamoon
4dac5fa02d Fix python + frontend tests 2022-09-30 18:32:21 -07:00
Michael Shamoon
f0cd10e960 Better migration of update checking settings, offer reload, strip backend_setting from db 2022-09-30 14:03:59 -07:00
Michael Shamoon
4eaeac3610 Frontend update checking settings 2022-09-30 12:30:23 -07:00
Trenton H
4621c1a83a Fix logger location tag
Co-authored-by: shamoon <4887959+shamoon@users.noreply.github.com>
2022-09-28 11:02:34 -07:00
Trenton H
62872bce9d Updates how task_args and task_kwargs are parsed, adds testing to cover everything I can think of 2022-09-28 10:40:55 -07:00
Michael Shamoon
aa7b2690cb Add related_document and direct link from task UI 2022-09-27 20:50:26 -07:00
Michael Shamoon
8c6d67bddf Filter out old migrated tasks 2022-09-27 19:41:23 -07:00
Michael Shamoon
e60a1e55f4 Add document name & error result parsing to PaperlessTask serializer 2022-09-27 19:40:24 -07:00
Trenton H
a96ac56c2b Ensures all existing one to one fields are nulled before altering the field 2022-09-27 14:17:42 -07:00
Trenton H
fe697f009c Fixes migration order back to the right way 2022-09-27 13:55:31 -07:00
Trenton H
e8d237ac7c Finalizes what the PaperlessTask will look like to the frontend 2022-09-27 12:44:01 -07:00
Trenton Holmes
09287701ae Transitions the backend to celery and celery beat 2022-09-26 11:25:34 -07:00
Paperless-ngx Translation Bot [bot]
8571edab6a New Crowdin updates (#1607)
* bugfix: increase delay

partially reverts 86358d556162e4c6112238821937f7856749f157
re-implements 4fbabe43ea12811864e9676b04d82a82b38e799d

Signed-off-by: Florian Brandes <florian.brandes@posteo.de>

* New translations messages.xlf (Finnish) [ci skip]

Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
Co-authored-by: Florian Brandes <florian.brandes@posteo.de>
2022-09-16 15:45:17 -07:00
Trenton H
13465fcfda Fixes grammar in comment
Co-authored-by: Florian <florian.brandes@posteo.de>
2022-09-16 09:08:16 -07:00
Trenton Holmes
b21f64de8a Updates how barcodes are detected, using pikepdf images, instead of converting each page to an image 2022-09-16 09:08:16 -07:00
Trenton Holmes
33a4a273a3 Fixes the seperation of files by barcode, during the case where 2 barcodes appear back to back 2022-09-14 14:00:37 -07:00
Michael Shamoon
24da6ce84e Include storage path in exporter + tests 2022-09-11 07:39:35 -07:00
Michael Shamoon
a1f1a0bf60 Fix 1023, 1024 migrations, add savedviewfilterrule migration 2022-08-26 10:02:48 -07:00
Michael Shamoon
2a3c424476 Add missing filter rule types to SavedViewFilterRule model
And add note to filter-rule-type.ts
2022-08-26 10:02:48 -07:00
Quinn Casey
75991f4268 Merge pull request #1451 from paperless-ngx/feature-better-redo-ocr
Feature: Even better re-do of OCR
2022-08-25 17:01:54 -07:00