mirror of
https://github.com/paperless-ngx/paperless-ngx.git
synced 2025-04-02 13:45:10 -05:00
322 lines
9.5 KiB
ReStructuredText
322 lines
9.5 KiB
ReStructuredText
|
|
**************
|
|
Administration
|
|
**************
|
|
|
|
|
|
Making backups
|
|
##############
|
|
|
|
.. warning::
|
|
|
|
This section is not updated yet.
|
|
|
|
So you're bored of this whole project, or you want to make a remote backup of
|
|
your files for whatever reason. This is easy to do, simply use the
|
|
:ref:`exporter <utilities-exporter>` to dump your documents and database out
|
|
into an arbitrary directory.
|
|
|
|
|
|
.. _migrating-restoring:
|
|
|
|
Restoring
|
|
=========
|
|
|
|
.. _administration-updating:
|
|
|
|
Updating paperless
|
|
##################
|
|
|
|
.. warning::
|
|
|
|
This section is not updated yet.
|
|
|
|
For the most part, all you have to do to update Paperless is run ``git pull``
|
|
on the directory containing the project files, and then use Django's
|
|
``migrate`` command to execute any database schema updates that might have been
|
|
rolled in as part of the update:
|
|
|
|
.. code-block:: shell-session
|
|
|
|
$ cd /path/to/project
|
|
$ git pull
|
|
$ pip install -r requirements.txt
|
|
$ cd src
|
|
$ ./manage.py migrate
|
|
|
|
Note that it's possible (even likely) that while ``git pull`` may update some
|
|
files, the ``migrate`` step may not update anything. This is totally normal.
|
|
|
|
Additionally, as new features are added, the ability to control those features
|
|
is typically added by way of an environment variable set in ``paperless.conf``.
|
|
You may want to take a look at the ``paperless.conf.example`` file to see if
|
|
there's anything new in there compared to what you've got in ``/etc``.
|
|
|
|
If you are using docker the update process is similar:
|
|
|
|
.. code-block:: shell-session
|
|
|
|
$ cd /path/to/project
|
|
$ git pull
|
|
$ docker build -t paperless .
|
|
$ docker-compose run --rm consumer migrate
|
|
$ docker-compose up -d
|
|
|
|
If ``git pull`` doesn't report any changes, there is no need to continue with
|
|
the remaining steps.
|
|
|
|
This depends on the route you've chosen to run paperless.
|
|
|
|
a. If you are not using docker, update python requirements. Paperless uses
|
|
`Pipenv`_ for managing dependencies:
|
|
|
|
.. code:: bash
|
|
|
|
$ pip install --upgrade pipenv
|
|
$ cd /path/to/paperless
|
|
$ pipenv install
|
|
|
|
This creates a new virtual environment (or uses your existing environment)
|
|
and installs all dependencies into it. Running commands inside the environment
|
|
is done via
|
|
|
|
.. code:: bash
|
|
|
|
$ cd /path/to/paperless/src
|
|
$ pipenv run python3 manage.py my_command
|
|
|
|
You will also need to build the frontend each time a new update is pushed.
|
|
See updating paperless for more information. TODO REFERENCE
|
|
|
|
b. If you are using docker, build the docker image.
|
|
|
|
.. code:: bash
|
|
|
|
$ docker build -t jonaswinkler/paperless-ng:latest .
|
|
|
|
Copy either docker-compose.yml.example or docker-compose.yml.sqlite.example
|
|
to docker-compose.yml and adjust the consumption directory.
|
|
|
|
Management utilities
|
|
####################
|
|
|
|
Paperless comes with some management commands that perform various maintenance
|
|
tasks on your paperless instance. You can invoce these commands either by
|
|
|
|
.. code:: bash
|
|
|
|
$ cd /path/to/paperless
|
|
$ docker-compose run --rm webserver <command> <arguments>
|
|
|
|
or
|
|
|
|
.. code:: bash
|
|
|
|
$ cd /path/to/paperless/src
|
|
$ pipenv run python manage.py <command> <arguments>
|
|
|
|
depending on whether you use docker or not.
|
|
|
|
All commands have built-in help, which can be accessed by executing them with
|
|
the argument ``--help``.
|
|
|
|
.. _utilities-exporter:
|
|
|
|
Document exporter
|
|
=================
|
|
|
|
The document exporter exports all your data from paperless into a folder for
|
|
backup or migration to another DMS.
|
|
|
|
.. code::
|
|
|
|
document_exporter target
|
|
|
|
``target`` is a folder to which the data gets written. This includes documents,
|
|
thumbnails and a ``manifest.json`` file. The manifest contains all metadata from
|
|
the database (correspondents, tags, etc).
|
|
|
|
When you use the provided docker compose script, specify ``../export`` as the
|
|
target. This path inside the container is automatically mounted on your host on
|
|
the folder ``export``.
|
|
|
|
|
|
.. _utilities-importer:
|
|
|
|
Document importer
|
|
=================
|
|
|
|
The document importer takes the export produced by the `Document exporter`_ and
|
|
imports it into paperless.
|
|
|
|
The importer works just like the exporter. You point it at a directory, and
|
|
the script does the rest of the work:
|
|
|
|
.. code::
|
|
|
|
document_importer source
|
|
|
|
When you use the provided docker compose script, put the export inside the
|
|
``export`` folder in your paperless source directory. Specify ``../export``
|
|
as the ``source``.
|
|
|
|
|
|
.. _utilities-retagger:
|
|
|
|
Document retagger
|
|
=================
|
|
|
|
Say you've imported a few hundred documents and now want to introduce
|
|
a tag or set up a new correspondent, and apply its matching to all of
|
|
the currently-imported docs. This problem is common enough that
|
|
there are tools for it.
|
|
|
|
.. code::
|
|
|
|
document_retagger [-h] [-c] [-T] [-t] [-i] [--use-first] [-f]
|
|
|
|
optional arguments:
|
|
-c, --correspondent
|
|
-T, --tags
|
|
-t, --document_type
|
|
-i, --inbox-only
|
|
--use-first
|
|
-f, --overwrite
|
|
|
|
Run this after changing or adding matching rules. It'll loop over all
|
|
of the documents in your database and attempt to match documents
|
|
according to the new rules.
|
|
|
|
Specify any combination of ``-c``, ``-T`` and ``-t`` to have the
|
|
retagger perform matching of the specified metadata type. If you don't
|
|
specify any of these options, the document retagger won't do anything.
|
|
|
|
Specify ``-i`` to have the document retagger work on documents tagged
|
|
with inbox tags only. This is useful when you don't want to mess with
|
|
your already processed documents.
|
|
|
|
When multiple document types or correspondents match a single document,
|
|
the retagger won't assign these to the document. Specify ``--use-first``
|
|
to override this behaviour and just use the first correspondent or type
|
|
it finds. This option does not apply to tags, since any amount of tags
|
|
can be applied to a document.
|
|
|
|
Finally, ``-f`` specifies that you wish to overwrite already assigned
|
|
correspondents, types and/or tags. The default behaviour is to not
|
|
assign correspondents and types to documents that have this data already
|
|
assigned. ``-f`` works differently for tags: By default, only additional tags get
|
|
added to documents, no tags will be removed. With ``-f``, tags that don't
|
|
match a document anymore get removed as well.
|
|
|
|
|
|
Managing the Automatic matching algorithm
|
|
=========================================
|
|
|
|
The *Auto* matching algorithm requires a trained neural network to work.
|
|
This network needs to be updated whenever somethings in your data
|
|
changes. The docker image takes care of that automatically with the task
|
|
scheduler. You can manually renew the classifier by invoking the following
|
|
management command:
|
|
|
|
.. code::
|
|
|
|
document_create_classifier
|
|
|
|
This command takes no arguments.
|
|
|
|
|
|
Managing the document search index
|
|
==================================
|
|
|
|
The document search index is responsible for delivering search results for the
|
|
website. The document index is automatically updated whenever documents get
|
|
added to, changed, or removed from paperless. However, if the search yields
|
|
non-existing documents or won't find anything, you may need to recreate the
|
|
index manually.
|
|
|
|
.. code::
|
|
|
|
document_index {reindex,optimize}
|
|
|
|
Specify ``reindex`` to have the index created from scratch. This may take some
|
|
time.
|
|
|
|
Specify ``optimize`` to optimize the index. This updates certain aspects of
|
|
the index and usually makes queries faster and also ensures that the
|
|
autocompletion works properly. This command is regularly invoked by the task
|
|
scheduler.
|
|
|
|
|
|
Managing filenames
|
|
==================
|
|
|
|
If you use paperless' feature to assign custom filenames to your documents
|
|
(TODO ref), you can use this command to move all your files after changing
|
|
the naming scheme.
|
|
|
|
.. warning::
|
|
|
|
Since this command moves you documents around alot, it is advised to to
|
|
a backup before. The renaming logic is robust and will never overwrite
|
|
or delete a file, but you can't ever be careful enough.
|
|
|
|
.. code::
|
|
|
|
document_renamer
|
|
|
|
The command takes no arguments and processes all your documents at once.
|
|
|
|
|
|
.. _utilities-encyption:
|
|
|
|
Managing encrpytion
|
|
===================
|
|
|
|
Documents can be stored in Paperless using GnuPG encryption.
|
|
|
|
.. danger::
|
|
|
|
Decryption is depreceated since paperless-ng 1.0 and doesn't really provide any
|
|
additional security, since you have to store the passphrase in a configuration
|
|
file on the same system as the encrypted documents for paperless to work. Also,
|
|
paperless provides transparent access to your encrypted documents.
|
|
|
|
Consider running paperless on an encrypted filesystem instead, which will then
|
|
at least provide security against physical hardware theft.
|
|
|
|
.. code::
|
|
|
|
change_storage_type [--passphrase PASSPHRASE] {gpg,unencrypted} {gpg,unencrypted}
|
|
|
|
positional arguments:
|
|
{gpg,unencrypted} The state you want to change your documents from
|
|
{gpg,unencrypted} The state you want to change your documents to
|
|
|
|
optional arguments:
|
|
--passphrase PASSPHRASE
|
|
|
|
Enabling encryption
|
|
-------------------
|
|
|
|
Basic usage to enable encryption of your document store (**USE A MORE SECURE PASSPHRASE**):
|
|
|
|
(Note: If ``PAPERLESS_PASSPHRASE`` isn't set already, you need to specify it here)
|
|
|
|
.. code::
|
|
|
|
change_storage_type [--passphrase SECR3TP4SSPHRA$E] unencrypted gpg
|
|
|
|
|
|
Disabling encryption
|
|
--------------------
|
|
|
|
Basic usage to enable encryption of your document store:
|
|
|
|
(Note: Again, if ``PAPERLESS_PASSPHRASE`` isn't set already, you need to specify it here)
|
|
|
|
.. code::
|
|
|
|
change_storage_type [--passphrase SECR3TP4SSPHRA$E] gpg unencrypted
|
|
|
|
|
|
.. _Pipenv: https://pipenv.pypa.io/en/latest/ |