jonaswinkler
1d073d2cfd
a couple fixes and more supported image files
2020-12-02 17:39:49 +01:00
jonaswinkler
daf2e345c4
code style
2020-12-01 14:33:37 +01:00
jonaswinkler
f08d494f1b
filename handling for archive files.
2020-11-30 21:38:42 +01:00
jonaswinkler
ddb3ef49f6
Merge branch 'dev' into feature-ocrmypdf
2020-11-29 18:37:38 +01:00
jonaswinkler
0e9d88ef7d
test cases for #67
2020-11-29 15:47:56 +01:00
jonaswinkler
56d4843a96
Merge branch 'dev' into feature-ocrmypdf
2020-11-29 01:35:37 +01:00
jonaswinkler
906fe4a68d
fixes #61
2020-11-28 19:28:46 +01:00
Jonas Winkler
17b62b61fa
add support for archive files.
2020-11-25 14:47:17 +01:00
Jonas Winkler
afc3753e58
code cleanup
2020-11-21 14:03:45 +01:00
Jonas Winkler
8b8a2af053
fixed the file handling implementation. The feature is cool, but the original implementation had so many small flaws it wasn't even funny.
2020-11-11 14:21:33 +01:00
Jonas Winkler
ec1f65a355
I removed the model save/delete hooks for index updates since they were causing too much trouble with migrations
2020-11-08 11:24:57 +01:00
Jonas Winkler
5686f76ef2
this was not required since saving a document updates the index anyway
2020-11-03 13:51:49 +01:00
Jonas Winkler
9a4ff3f807
replaced usages of .id with .pk, fixed filename issue in exporter
2020-11-03 12:37:37 +01:00
Jonas Winkler
048ff1de95
fixed unnecessary assignation of correspondents / types to documents
2020-11-02 19:27:24 +01:00
Jonas Winkler
a89773ad71
removed unused code, small fixes
2020-11-02 18:20:04 +01:00
Jonas Winkler
def3a85858
reworked most of the tesseract parser, better logging
2020-11-02 15:40:44 +01:00
Jonas Winkler
dd16b7262e
unified document matching, legacy and automatching work alongside now
2020-10-28 11:45:11 +01:00
Jonas Winkler
368b6d0512
removed some obsolete fields from the document model
2020-10-28 01:41:18 +01:00
Jonas Winkler
93d963ed4e
added
...
- document index
- api access for thumbnails/downloads
- more api filters
updated
- pipfile
removed
- filename handling
- legacy thumb/download access
- obsolete admin gui settings (per page items, FY, inline view)
2020-10-25 23:03:02 +01:00
Jonas Winkler
efc7bf1d23
Code style adjustments
2018-09-25 16:09:33 +02:00
Jonas Winkler
35ea0f2add
Merge branch 'machine-learning' into dev
2018-09-11 14:36:21 +02:00
Jonas Winkler
8a16b62773
The classifier works with ids now, not names. Minor changes.
2018-09-11 14:30:18 +02:00
Jonas Winkler
1c8576cfb9
mode change
2018-09-06 12:00:01 +02:00
Jonas Winkler
9d4155a907
removed matching model fields, automatic classifier reloading, added autmatic_classification field to matching model
2018-09-04 18:40:26 +02:00
Jonas Winkler
c50c517928
Implemented the classifier model, including automatic tagging of new documents
2018-09-04 14:39:55 +02:00
Jonas Winkler
11b4e56790
Added document type
2018-08-24 13:45:15 +02:00
Jonas Winkler
c03cfb176c
inbox tags, archive tags, archive serial number for documents
2018-07-06 13:25:02 +02:00
CkuT
0c28a5329c
Fix LogEntry user when PAPERLESS_DISABLE_LOGIN is set to true
2018-04-17 21:03:18 +02:00
Erik Arvstedt
363b8ff9aa
Fix runtime warning when adding log entries
...
LogEntry.action_time expects a Django timezone object instead of a builtin datetime.
This fixes a runtime warning of the following kind:
RuntimeWarning: DateTimeField LogEntry.action_time received a naive datetime (2018-03-28 20:53:01.714173) while time zone support is active.
2018-03-30 00:15:52 +02:00
CkuT
53214b7993
Use datetime.now() instead of document.created for LogEntry action_time
2018-03-13 19:09:48 +01:00
CkuT
feab3e5dfc
Add LogEntry after document consumption
...
See #319
2018-03-11 17:09:43 +01:00
Daniel Quinn
d2c283582b
feat: refactor for pluggable consumers
...
I've broken out the OCR-specific code from the consumers and dumped it
all into its own app, `paperless_tesseract`. This new app should serve
as a sample of how to create one's own consumer for different file
types.
Documentation for how to do this isn't ready yet, but for the impatient:
* Create a new app
* containing a `parsers.py` for your parser modelled after
`paperless_tesseract.parsers.RasterisedDocumentParser`
* containing a `signals.py` with a handler moddelled after
`paperless_tesseract.signals.ConsumerDeclaration`
* connect the signal handler to
`documents.signals.document_consumer_declaration` in
`your_app.apps`
* Install the app into Paperless by declaring
`PAPERLESS_INSTALLED_APPS=your_app`. Additional apps should be
separated with commas.
* Restart the consumer
2017-03-25 15:10:25 +00:00
Daniel Quinn
0a0ff8da90
Fixed an ugly bug that broke all deletions
2016-08-23 15:54:09 +01:00
Daniel Quinn
3bbcd562fc
Account for thumbnail deletion as well
2016-08-20 14:03:42 +01:00
Daniel Quinn
f5daded930
Fix for #131 : delete files on document.delete
2016-08-16 19:13:37 +01:00
Lenz Weber
e7566d2b1c
style changes, variable renames
...
* PEP8 conformity
* rename run_post_consume_external_script to run_post_consume_script
* rename run_pre_consume_external_script to run_pre_consume_script
* change order of declaration and use from post...pre to pre...post
2016-06-24 16:53:55 +02:00
Lenz Weber
c728b1dd21
add pre-consume hook
...
a script hook can be defined in /etc/paperless.conf as
PAPERLESS_PRE_CONSUME_SCRIPT
2016-06-23 21:57:17 +02:00
Daniel Quinn
e0ce2a7656
The consumer hook shouldn't contain the passphrase. That was a bad idea
2016-04-03 19:35:17 +01:00
Daniel Quinn
96b8dd6d56
Added the passphrase to the consume hook and fixed the docs for it
2016-04-03 18:47:49 +01:00
Daniel Quinn
cb2df58b27
Everything appears to be working
2016-03-28 19:47:11 +01:00
Daniel Quinn
b4e648e1e3
Test All The Things
2016-03-28 14:16:26 +01:00
Daniel Quinn
b92e007e15
Removed log components and introduced signals for tags & correspondents
2016-03-28 11:11:15 +01:00