34 Commits

Author SHA1 Message Date
Trenton H
61d0459e3c Feature: Switches to a new client to handle communication with Gotenberg (#4391)
Switches to a new client to handle communication with Gotenberg for merging and generating PDFs
2023-10-20 00:27:29 +00:00
Trenton Holmes
34b80a4d8e Removes support for Python 3.8 and lower from the code base 2023-09-10 11:42:59 -07:00
Trenton H
ac8f5eaa7d Unlocks all things which were locked due to ARMv7, etc 2023-09-08 11:21:43 -07:00
Trenton Holmes
0ddb62943c Sets the http timeouts equal to the task timeout, so it's either done or really done 2023-08-23 18:40:22 -07:00
Trenton H
97d9edda96 Updates tika client library and handle the changes to it 2023-06-26 10:41:05 -06:00
Trenton H
fe9874a96a Fixes generation of thumbnails when the archive file hasn't already been created 2023-06-26 10:36:50 -06:00
Trenton H
7a63bcc817 Updates the httpx timeout to be 30s for all operations 2023-06-19 08:59:51 -07:00
Trenton Holmes
48ab961c68 Adds better error handling/checking around getting content of a document via Tika
Signed-off-by: Trenton Holmes <797416+stumpylog@users.noreply.github.com>
2023-06-18 08:39:17 -07:00
Trenton H
1b3492a01f Rewrites the email parsing to be more clear and concise.
Adds testing to use httpx mocked responses to stand in as a server even offline
2023-06-06 09:05:26 -07:00
Trenton H
36f09c4974 Swapping out the tika and replaces requests with httpx 2023-06-06 09:05:26 -07:00
Trenton H
bad8d304cb Improves the logging mixin and allows it to be typed better 2023-05-23 17:16:39 -07:00
Trenton H
aabcc9a1c4 Upgrades black to v23, upgrades ruff 2023-04-26 09:35:27 -07:00
Trenton H
30655f1b73 Fixes ruff not running isort against the codebase 2023-04-26 09:35:27 -07:00
Trenton H
d2c02b9102 Configures ruff as the one stop linter and resolves warnings it raised 2023-04-01 17:03:52 -07:00
Trenton H
dc95cc3cd4 Adds setting to Gotenberg API call for outputting the correct PDF/A format 2023-01-27 11:05:23 -08:00
Trenton Holmes
43e51cd7a9 Check for and handle the mail's date being naive 2022-12-31 12:35:42 -08:00
phail
daf5a72e82 change order of elements in parsed Texts 2022-11-20 20:24:36 +01:00
phail
d50e9e589a fix filenames 2022-11-20 19:53:57 +01:00
Trenton Holmes
7ab45080b7 Fixes one more place which used manual size formatting 2022-11-20 09:15:06 -08:00
Trenton Holmes
f02a4fa7cf Use Django templating engine 2022-11-20 09:13:16 -08:00
phail
1fbde90833 eml parsing requires tika 2022-11-20 14:22:30 +01:00
phail
1aba154fb7 optimize regex 2022-11-20 12:48:03 +01:00
phail
3779adb30e use html.escape instead of some self build functions 2022-11-20 12:36:49 +01:00
phail
ee4309a8bd remove log mocking, replace pytest raises, use humanfriendly 2022-11-20 12:06:35 +01:00
phail
b151cb7293 update variable names 2022-10-23 21:39:15 +02:00
phail
9d6b725fa1 add tests for mail_to_html and generate_pdf_from_mail 2022-10-23 17:18:10 +02:00
phail
6854896708 test for broken eml, add test_generate_pdf 2022-10-22 02:25:23 +02:00
phail
3d58129666 add unittest for transform_inline_html 2022-10-18 23:48:07 +02:00
phail
cef1a4f8b9 Add unitest for tika_parse() 2022-10-15 13:13:29 +02:00
phail
037c8a4950 add attachment size to pdf 2022-05-05 00:20:32 +02:00
phail
b14153b6cf fix consumption of mails without html
split pdf generation functions
2022-05-04 23:42:59 +02:00
phail
9c5f824bb3 add parsing of inline attachments
remove insecure chromium option
fix html parsing
2022-05-03 23:21:33 +02:00
phail
404026dafb remove .eml parser from tika 2022-05-03 18:02:08 +02:00
phail
79579d6239 add eml parser to paperless_mail 2022-04-29 22:58:11 +02:00