321 Commits

Author SHA1 Message Date
Daniel Quinn
fad466477b More verbose error logging 2016-03-03 18:18:48 +00:00
Daniel Quinn
55dcbcc47f Forgot a slash 2016-03-03 18:18:38 +00:00
Daniel Quinn
b8be20b565 Preparing for a proper UI 2016-03-03 18:09:10 +00:00
Daniel Quinn
0aead1fbe6 #68: Using dotenv for a proper unix config file 2016-03-03 17:59:27 +00:00
Daniel Quinn
857c7ac654 #44: Harmonise environment variables with constant names 2016-03-01 22:39:40 +00:00
Daniel Quinn
26c3781350 #44: Harmonise environment variables with constant names 2016-03-01 22:37:42 +00:00
Daniel Quinn
7d1aa1175f pep8 2016-03-01 19:03:28 +00:00
Daniel Quinn
26fc27da9b Setting appropriate permissions 2016-03-01 18:57:12 +00:00
Daniel Quinn
85f5963851 Fixt capitalisation 2016-02-28 15:02:18 +00:00
Daniel Quinn
439b60ce5c Merged new logging system 2016-02-28 15:01:19 +00:00
Daniel Quinn
9379e95446 Added a test for the new by_group() feature 2016-02-28 00:53:18 +00:00
Daniel Quinn
8687892332 Don't print to standard out during a test 2016-02-28 00:52:44 +00:00
Daniel Quinn
5a8e75112f Added a custom manager for groupped logs 2016-02-28 00:41:03 +00:00
Daniel Quinn
d686aba9ae Reset the group id for every pull 2016-02-28 00:40:08 +00:00
Daniel Quinn
631aa99d92 No need to pass verbosity around anymore 2016-02-28 00:39:40 +00:00
Daniel Quinn
a4d89ed124 Fixt the test to ignore verbosity 2016-02-27 20:50:48 +00:00
Daniel Quinn
1bfd5bcc2f Merge branch 'master' of https://github.com/danielquinn/paperless 2016-02-27 20:19:32 +00:00
Daniel Quinn
51173d80cf License clarification 2016-02-27 20:19:09 +00:00
Daniel Quinn
2fe9b0cbc1 New logging appears to work 2016-02-27 20:18:50 +00:00
Daniel Quinn
df1741e1fa Added a time to the logger admin 2016-02-26 11:21:14 +00:00
Daniel Quinn
1aecb1e63a Compensate for case and format of jpg vs. jpeg 2016-02-23 20:15:13 +00:00
Daniel Quinn
312cb523d8 /tmp is probably better than /tmp/paperless/consume 2016-02-21 02:30:39 +00:00
Daniel Quinn
3a7923e32d Moved pyocr.get_available_tools() into a method 2016-02-21 02:24:05 +00:00
Daniel Quinn
6f7169d2d6 Travis integration: take 3 2016-02-21 01:46:49 +00:00
Daniel Quinn
55a7dc2444 pep8 2016-02-21 01:43:48 +00:00
Daniel Quinn
c7787bc076 Let's see if I can get Travis CI working on the first try 2016-02-21 01:37:57 +00:00
Daniel Quinn
0d46643026 Version bump 2016-02-21 01:24:30 +00:00
Daniel Quinn
17d3a44952 A crude API is in place 2016-02-21 00:55:38 +00:00
Daniel Quinn
809fb8fa1f Moved the default GNUPG home to /tmp for tox-friendliness 2016-02-21 00:29:59 +00:00
Daniel Quinn
440614eddc Got tox working 2016-02-21 00:29:21 +00:00
Daniel Quinn
422ae9303a pep8 2016-02-21 00:14:50 +00:00
Daniel Quinn
a5124cade6 Merge branch 'master' into feature/api 2016-02-20 22:55:42 +00:00
Daniel Quinn
51b19f4c19 Issue #57 2016-02-20 22:30:01 +00:00
Pit Kleyersburg
c45f951ca0 Ignore error if orientation detection fails
Fixes an additional issue that came up in #48.
2016-02-19 09:52:32 +01:00
Daniel Quinn
ec88ea73f6 #48: make the tag matching smarter 2016-02-19 00:45:02 +00:00
Pit Kleyersburg
724afa59c7 Add Dockerfile for application and documentation
This commit adds a `Dockerfile` to the root of the project, accompanied
by a `docker-compose.yml.example` for simplified deployment. The
`Dockerfile` is agnostic to whether it will be the webserver, the
consumer, or if it is run for a one-off command (i.e. creation of a
superuser, migration of the database, document export, ...).

The containers entrypoint is the `scripts/docker-entrypoint.sh` script.
This script verifies that the required permissions are set, remaps the
default users and/or groups id if required and installs additional
languages if the user wishes to.

After initialization, it analyzes the command the user supplied:

  - If the command starts with a slash, it is expected that the user
    wants to execute a binary file and the command will be executed
    without further intervention. (Using `exec` to effectively replace
    the started shell-script and not have any reaping-issues.)

  - If the command does not start with a slash, the command will be
    passed directly to the `manage.py` script without further
    modification. (Again using `exec`.)

The default command is set to `--help`.

If the user wants to execute a command that is not meant for `manage.py`
but doesn't start with a slash, the Docker `--entrypoint` parameter can
be used to circumvent the mechanics of `docker-entrypoint.sh`.

Further information can be found in `docs/setup.rst` and in
`docs/migrating.rst`.

For additional convenience, a `Dockerfile` has been added to the `docs/`
directory which allows for easy building and serving of the
documentation. This is documented in `docs/requirements.rst`.
2016-02-18 22:58:32 +01:00
Pit Kleyersburg
c34d57a872 Detect image orientation if the OCR supports it
Fixes issue #47.
2016-02-18 09:37:13 +01:00
Daniel Quinn
1e7ece81ee Fixes #45 2016-02-17 23:07:54 +00:00
Daniel Quinn
eb01bcf98b The Log class needed a __str__() method 2016-02-17 23:06:35 +00:00
Daniel Quinn
1c45ca10d4 Patched sorting 2016-02-17 00:11:57 +00:00
Daniel Quinn
550184cbae Patched sorting 2016-02-17 00:11:46 +00:00
Daniel Quinn
6f95b05287 Support appropriate sorting for long documents 2016-02-17 00:10:05 +00:00
Pit Kleyersburg
46f8f492f5 Safely and non-randomly create scratch directory
Creating the scratch-files in `_get_grayscale` using a random integer is
for one inherently unsafe and can cause a collision. On the other hand,
it should be unnecessary given that the files will be cleaned up after
the OCR run.

Since we don't know if OCR runs might be parallel in the future, this
commit implements thread-safe and deterministic directory-creation.

Additionally it fixes the call to `_cleanup` by `consume`. In the
current implementation `_cleanup` will not be called if the last
consumed document failed with an `OCRError`, this commit fixes this.
2016-02-16 12:15:57 +01:00
Daniel Quinn
cebc44f2c9 API is halfway there 2016-02-16 09:28:34 +00:00
Daniel Quinn
2f0da8ab25 Added download_url to the Document model 2016-02-15 22:38:18 +00:00
Pit Kleyersburg
7b227ffa2f Fix matching if user supplied an empty value 2016-02-14 19:47:05 +01:00
Daniel Quinn
aea4af5d3b Version bump and feature update 2016-02-14 17:18:28 +00:00
Daniel Quinn
a0f4f6c5f2 Fixed merge conflict and did some pep8 2016-02-14 17:13:48 +00:00
Pit Kleyersburg
aeab9a0e81 Detect language only on one page of PDF
To detect the language currently the entire document gets processed. If
a different language has been detected than the default one, the entire
document will be processed again for the new language.

This PR analyzes the middle page for its language and either processes
the remaining pages with the default language if it didn't differ, or
processes all pages for the new guessed language.

The amount of processed pages comes down from the worst case `2n` to
worst case `n+1`.
2016-02-14 17:55:13 +01:00
Daniel Quinn
7843ea5037 Added and implemented a rudimentary logger 2016-02-14 16:09:52 +00:00