1345 Commits

Author SHA1 Message Date
Michael Shamoon
ebcf75b1b4 comment search highlighting 2023-01-04 19:06:51 -08:00
Michael Shamoon
7d6dc5c954 Support comment search 2023-01-04 19:06:06 -08:00
Trenton Holmes
474433021e Adds testing coverage and fixing up post API tests 2022-12-30 14:00:40 -08:00
Trenton Holmes
de6cde2eaf Updates the post view to return the async task id. Adds query parameter to the tasks endpoint to get single task status 2022-12-30 13:43:49 -08:00
Michael Shamoon
c0c36aeae3 support tags__id__none for advanced search 2022-12-17 19:49:17 -08:00
shamoon
24029f27b4 Merge pull request #2004 from paperless-ngx/feature-export-to-zip
Feature: Adds option to allow a user to export directly to a zipfile
2022-12-16 09:12:38 -08:00
Trenton H
58d6ad96b7 Adds a basic admin view for tasks 2022-12-15 11:31:50 -08:00
Trenton Holmes
eeb96a3c94 Notes a TODO for a later library release to remove a workaround 2022-12-12 15:37:01 -08:00
Trenton H
5030a5212e Switches task serialization over to pickle format 2022-12-12 15:37:01 -08:00
shamoon
cf8b949325 Merge pull request #2028 from paperless-ngx/feature-truncate-content
Feature: speed up frontend by truncating content
2022-12-09 09:20:06 -08:00
Trenton Holmes
447a892f17 Documents the test conditions 2022-12-08 18:32:14 -08:00
Trenton Holmes
9a89fe455c Adds an optional API field to follow the filenaming format when creating a bulk download zip 2022-12-08 18:32:14 -08:00
Trenton Holmes
793385114d Adds documentation 2022-12-04 16:53:28 -08:00
Trenton Holmes
a79be46b81 Adds option to allow a user to export directory to a zipfile 2022-12-04 16:38:25 -08:00
Michael Shamoon
7c55144b31 Fix docs references 2022-12-03 02:04:30 -08:00
Trenton H
8638b893b3 Don't silence an exception when trying to handle file naming 2022-11-28 13:44:17 -08:00
Michael Shamoon
6d0d738b24 option to truncate content on /documents endpoint 2022-11-27 21:19:23 -08:00
Trenton H
23ee387d96 If override_date is provided, coerce it into a datetime 2022-11-22 10:11:27 -08:00
shamoon
779fded0e3 Merge pull request #1998 from paperless-ngx/fix/1993-date-overflow
Bugfix: Don't allow exceptions during date parsing to fail consume
2022-11-17 14:31:46 -08:00
Trenton H
cdfcbff529 Don't allow an exception when trying to parse a date cause complete failure 2022-11-17 13:37:37 -08:00
Trenton H
32355e859f Don't do decoding work if not needed 2022-11-11 10:09:56 -08:00
Trenton H
2228cc292d Adds further testing to cover scripts with non-zero exit codes 2022-11-11 08:58:49 -08:00
Trenton H
7a0763a128 Fixes stderr appearing to have content when it doesn't 2022-11-10 19:01:54 -08:00
Trenton Holmes
81c98103a8 Captures the stdout and stderr of the pre/post scripts into the log 2022-11-10 17:26:47 -08:00
Trenton H
189d02dfe6 Always use pikepdf, then pdf2image if needed to check for barcodes instead of requiring/allowing configuration 2022-11-09 13:01:39 -08:00
Michael Shamoon
1268daf6fe Celery JSON serializable object for override_tag_ids 2022-11-07 07:41:23 -08:00
Trenton H
cce0d363e4 Refresh the instance from the database before doing any file handling with it 2022-11-03 11:32:27 -07:00
Trenton Holmes
59ea37f09c No need for an extra import, the object is smart already 2022-11-01 08:44:30 -07:00
Trenton Holmes
f8c3f12146 Adds more options for the filename formatting 2022-11-01 08:44:30 -07:00
Max Bachmann
4a061c38d7 directly use rapidfuzz 2022-10-31 13:17:10 -07:00
Trenton H
1e1f0347fa More smoothly handle the case of a password protected PDF for barcodes 2022-10-24 13:16:14 -07:00
Trenton H
6d2851c693 Allows using pdf2image instead of pikepdf if desired 2022-10-24 09:58:34 -07:00
Trenton H
20b7287dc2 Connects up the celery signals to support pending, started and success/failure, without relying on django-celery-results 2022-10-24 09:10:10 -07:00
Trenton Holmes
ddef90d96e Adds specific handling for CCITT Group 4, which pikepdf decodes, but not correctly 2022-10-11 13:51:14 -07:00
Trenton H
c888b3dfd3 In case pikepdf fails to convert an image to a PIL image, fall back to converting pages to PIL images 2022-10-11 13:51:13 -07:00
Trenton H
0c08b16402 Catch the new error raised by redis when it can't find the broker and stub out the call for testing 2022-10-10 14:21:42 -07:00
Trenton H
4994df2e3c Fixes usage of a depracated logger method 2022-10-10 14:20:19 -07:00
Trenton H
2d71415ede Allows disabling NLTK, adds it as a consideration for low power devices 2022-10-10 08:58:23 -07:00
Trenton Holmes
a78d44ec5f Changes the NLTK language to be based on the Tesseract OCR language, with fallback to the default processing 2022-10-10 08:58:23 -07:00
Trenton H
0bc13c2a72 Allows configuration of the NLTK processing language 2022-10-10 08:58:23 -07:00
Trenton Holmes
70b1988a55 Fixes the download and usage of the downloaded data 2022-10-10 08:58:23 -07:00
Trenton Holmes
3c12f13df2 Missed one mock 2022-10-10 08:58:23 -07:00
Trenton Holmes
d334eec321 Mock out the nltk portions so the data doesn't need to be downloaded 2022-10-10 08:58:23 -07:00
Trenton Holmes
66884ea035 Updates the pre-processing of document content to be much more robust, with tokenization, stemming and stop word removal 2022-10-10 08:58:23 -07:00
shamoon
3b1f5668f5 Merge pull request #1648 from paperless-ngx/feature-use-celery
Feature: Transition to celery for background tasks
2022-10-10 00:07:55 -07:00
Trenton Holmes
397179ebe9 Fixes is_relative_to not being availible for 3.8 2022-10-09 17:43:58 -07:00
Trenton Holmes
3b5a7fbd65 If the original file containing a barcode was in the temporary scratch dir, move the split files to consume dir 2022-10-09 17:43:58 -07:00
shamoon
962d0bc57b Merge pull request #1692 from paperless-ngx/feature-frontend-update-checking
Feature: frontend update checking settings
2022-10-05 13:46:32 -07:00
Michael Shamoon
f587e36744 Use text mime type for csv files for browser preview
Co-Authored-By: Trenton H <797416+stumpylog@users.noreply.github.com>
Co-Authored-By: bin101 <12427722+bin101@users.noreply.github.com>
2022-10-04 13:01:06 -07:00
Trenton H
8111723130 Fixes migration error if some tasks are defined already 2022-10-04 07:56:40 -07:00