282 Commits

Author SHA1 Message Date
Trenton H
37ae30ccda Forces JSON files to be written as UTF-8, and disables the ensure ASCII option which escapes non-ASCII chars (#4574) 2023-11-13 14:18:21 -08:00
Trenton H
efe779b634 Chore: Cleanup command arguments and standardize process count handling (#4541)
Cleans up some command help text and adds more control over process count for command with a Pool
2023-11-09 11:46:37 -08:00
Trenton H
7769b8968c Fixes the returning of a set, change to a list instead (#4442) 2023-10-25 16:18:13 -07:00
Trenton H
7291f11be7 Ensures all old connections are closed in certain longer running locations (#4265) 2023-09-25 12:52:22 -07:00
Trenton H
b162d7afc8 Handle when INotify fails to import but the polling is set to 0 still (#4230) 2023-09-20 14:04:42 -07:00
Trenton H
af9de2838b Allow the user the specifiy the zip file name (#4189) 2023-09-15 16:33:28 -07:00
Trenton H
9e333bf91d Cleans up the docs, adds validation of the process count, include the test descriptions 2023-09-12 08:17:12 -07:00
Trenton H
e82aaa7d85 Adds multiprocessing to the fuzzy matching for a speedup 2023-09-12 08:17:12 -07:00
shamoon
060966aa22 Add progress bar to document_fuzzy_match 2023-09-12 08:17:12 -07:00
Trenton Holmes
e4bf3c6ac4 Implements a new command for fuzzy matching document content and reporting potential duplicates 2023-09-12 08:17:12 -07:00
Trenton Holmes
34b80a4d8e Removes support for Python 3.8 and lower from the code base 2023-09-10 11:42:59 -07:00
Trenton H
ac8f5eaa7d Unlocks all things which were locked due to ARMv7, etc 2023-09-08 11:21:43 -07:00
Kamil Kosek
addc116f55 Enhancement: add --id-range for document_retagger (#4080)
---------

Co-authored-by: Trenton H <797416+stumpylog@users.noreply.github.com>
2023-09-08 17:33:24 +00:00
Trenton H
e160580c8b Fixes issues with copy2 or copystat and SELinux see #3665 2023-07-22 06:27:49 -07:00
shamoon
6e9be0ba9b Include global and object-level permissions in export / import
adds test for transaction
2023-06-23 23:33:36 -07:00
shamoon
2b9d30cb68 Exclude consumer & AnonymousUser users from export manifest 2023-05-30 20:51:25 -07:00
Trenton Holmes
e34b8c4b1f Replace usages of os.rename with shutil.move to properly handle cases where the source and dest arent't on the same filesystem 2023-05-01 07:28:52 -07:00
Trenton H
aabcc9a1c4 Upgrades black to v23, upgrades ruff 2023-04-26 09:35:27 -07:00
Trenton H
30655f1b73 Fixes ruff not running isort against the codebase 2023-04-26 09:35:27 -07:00
Trenton H
98811df439 Make the importer a little more robust against some types of errors 2023-04-26 07:08:50 -07:00
Trenton H
d2c02b9102 Configures ruff as the one stop linter and resolves warnings it raised 2023-04-01 17:03:52 -07:00
Trenton H
36a6df0bae Creates a data model for the document consumption, allowing stronger typing of arguments and setting of some information about the file only once 2023-04-01 11:05:34 -07:00
Trenton H
8d0e65d369 Limit the number of threads waiting for files to be ready during polling 2023-03-21 07:46:57 -07:00
shamoon
5108bbc192 Rename comments --> notes 2023-03-18 13:59:17 -07:00
Trenton H
67015a8eef Be sure the scratch directory exists before using it as temporary directory 2023-03-01 07:13:31 -08:00
Trenton H
a9c77216b2 Updates ignore path filtering so files in a folder in an ignored folder will be ignored correctly 2023-02-16 09:05:11 -08:00
Trenton H
342713d6d2 Moves the mktime call into the if block where it is used, preventing exceptions during rare cases 2023-02-02 07:25:32 -08:00
Matthieu Helleboid
23157b8050 replace --use-filename-prefix with --use-folder-prefix 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
b9807c96ae sort exporter option by alphabetical order 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
ec02009c91 use pathlib glob 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
420ec2187d add document comments to dedicated manifest file when using 'split-manifest' 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
343bd7df36 Fix split_manifest default value 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
7f287bb311 fix json serialization bug after migration after to Pathlib) 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
8d34fbc756 fix bug on administration exporter when using -d, --delete option 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
b947f63fd9 add split-manifest option to administration exporter 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
44d8bd3151 add prefix option to administration exporter 2023-01-24 11:06:49 -08:00
Matthieu Helleboid
42d4550d9c add no-archive and no-thumbnail options to administration exporter and importer 2023-01-24 11:06:49 -08:00
Trenton H
873522fd82 Updates the exporter to use pathlib and add a few more tests for coverage 2023-01-14 06:33:12 -08:00
Trenton Holmes
a79be46b81 Adds option to allow a user to export directory to a zipfile 2022-12-04 16:38:25 -08:00
Michael Shamoon
1268daf6fe Celery JSON serializable object for override_tag_ids 2022-11-07 07:41:23 -08:00
Trenton Holmes
09287701ae Transitions the backend to celery and celery beat 2022-09-26 11:25:34 -07:00
Michael Shamoon
24da6ce84e Include storage path in exporter + tests 2022-09-11 07:39:35 -07:00
Quinn Casey
75991f4268 Merge pull request #1451 from paperless-ngx/feature-better-redo-ocr
Feature: Even better re-do of OCR
2022-08-25 17:01:54 -07:00
shamoon
4f6ad5369b Merge pull request #1375 from tim-vogel/add_comments
Feature: document comments
2022-08-25 11:48:31 -07:00
shamoon
08e143547b Merge pull request #1421 from paperless-ngx/feature-consumer-event-driven
Feature: Event driven consumer
2022-08-25 08:16:47 -07:00
Michael Shamoon
680055474e Basic verification of Comment export & exporter comment tuple fix
From oprhaned commits b1855a4b7a
53f21574fd

Co-Authored-By: Trenton Holmes <797416+stumpylog@users.noreply.github.com>
2022-08-24 14:24:10 -07:00
Michael Shamoon
b0ae307add python code style 2022-08-23 19:20:08 -07:00
tim-vogel
278e9c12e1 add comment function 2022-08-23 19:19:21 -07:00
Trenton Holmes
059242b7dc Implements a better re-do of OCR by making the document archiver function common. Actually creates updated file now 2022-08-21 18:20:59 -07:00
Trenton Holmes
b6060ff990 Adds the storage paths to the re-tagger command 2022-08-21 18:01:19 -07:00