************** Administration ************** Making backups ############## .. warning:: This section is not updated yet. So you're bored of this whole project, or you want to make a remote backup of your files for whatever reason. This is easy to do, simply use the :ref:`exporter ` to dump your documents and database out into an arbitrary directory. .. _migrating-restoring: Restoring ========= .. _administration-updating: Updating paperless ################## For the most part, all you have to do to update Paperless is run ``git pull`` on the directory containing the project files, and then rebuild the docker image. .. code-block:: shell-session $ cd /path/to/paperless $ git pull If ``git pull`` doesn't report any changes, there is no need to continue with the remaining steps. After that, check if ``docker-compose.yml.example`` has changed. Update your ``docker-compose.yml`` file if necessary. .. code-block:: shell-session $ docker-compose down $ docker build -t jonaswinkler/paperless-ng . $ docker-compose up -d The docker image will take care of database migrations during startup. Updating paperless without docker ================================= Since paperless now involves a single page app that has to be built from source, updating paperless manually is somewhat more complicated. 1. Update python requirements. Paperless uses `Pipenv`_ for managing dependencies: .. code:: shell-session $ pip install --upgrade pipenv $ cd /path/to/paperless $ pipenv install This creates a new virtual environment (or uses your existing environment) and installs all dependencies into it. 2. You will also need to build the frontend each time a new update is pushed. You need `npm `_ for this. .. code:: shell-session $ cd src-ui $ npm install @angular/cli $ ng build --prod This will build the application and move the relevant files to a location within the django app (``src/documents/static/frontend``) at which django expects to find the files. 3. Collect static files, namely the newly created frontend files. .. code:: shell-session $ cd src $ pipenv run python3 manage.py collectstatic --clear 4. Migrate the database. .. code:: shell-session $ cd src $ pipenv run python3 manage.py migrate Management utilities #################### Paperless comes with some management commands that perform various maintenance tasks on your paperless instance. You can invoce these commands either by .. code:: bash $ cd /path/to/paperless $ docker-compose run --rm webserver or .. code:: bash $ cd /path/to/paperless/src $ pipenv run python manage.py depending on whether you use docker or not. All commands have built-in help, which can be accessed by executing them with the argument ``--help``. .. _utilities-exporter: Document exporter ================= The document exporter exports all your data from paperless into a folder for backup or migration to another DMS. .. code:: document_exporter target ``target`` is a folder to which the data gets written. This includes documents, thumbnails and a ``manifest.json`` file. The manifest contains all metadata from the database (correspondents, tags, etc). When you use the provided docker compose script, specify ``../export`` as the target. This path inside the container is automatically mounted on your host on the folder ``export``. .. _utilities-importer: Document importer ================= The document importer takes the export produced by the `Document exporter`_ and imports it into paperless. The importer works just like the exporter. You point it at a directory, and the script does the rest of the work: .. code:: document_importer source When you use the provided docker compose script, put the export inside the ``export`` folder in your paperless source directory. Specify ``../export`` as the ``source``. .. _utilities-retagger: Document retagger ================= Say you've imported a few hundred documents and now want to introduce a tag or set up a new correspondent, and apply its matching to all of the currently-imported docs. This problem is common enough that there are tools for it. .. code:: document_retagger [-h] [-c] [-T] [-t] [-i] [--use-first] [-f] optional arguments: -c, --correspondent -T, --tags -t, --document_type -i, --inbox-only --use-first -f, --overwrite Run this after changing or adding matching rules. It'll loop over all of the documents in your database and attempt to match documents according to the new rules. Specify any combination of ``-c``, ``-T`` and ``-t`` to have the retagger perform matching of the specified metadata type. If you don't specify any of these options, the document retagger won't do anything. Specify ``-i`` to have the document retagger work on documents tagged with inbox tags only. This is useful when you don't want to mess with your already processed documents. When multiple document types or correspondents match a single document, the retagger won't assign these to the document. Specify ``--use-first`` to override this behaviour and just use the first correspondent or type it finds. This option does not apply to tags, since any amount of tags can be applied to a document. Finally, ``-f`` specifies that you wish to overwrite already assigned correspondents, types and/or tags. The default behaviour is to not assign correspondents and types to documents that have this data already assigned. ``-f`` works differently for tags: By default, only additional tags get added to documents, no tags will be removed. With ``-f``, tags that don't match a document anymore get removed as well. Managing the Automatic matching algorithm ========================================= The *Auto* matching algorithm requires a trained neural network to work. This network needs to be updated whenever somethings in your data changes. The docker image takes care of that automatically with the task scheduler. You can manually renew the classifier by invoking the following management command: .. code:: document_create_classifier This command takes no arguments. Managing the document search index ================================== The document search index is responsible for delivering search results for the website. The document index is automatically updated whenever documents get added to, changed, or removed from paperless. However, if the search yields non-existing documents or won't find anything, you may need to recreate the index manually. .. code:: document_index {reindex,optimize} Specify ``reindex`` to have the index created from scratch. This may take some time. Specify ``optimize`` to optimize the index. This updates certain aspects of the index and usually makes queries faster and also ensures that the autocompletion works properly. This command is regularly invoked by the task scheduler. .. _utilities-renamer: Managing filenames ================== If you use paperless' feature to assign custom filenames to your documents (TODO ref), you can use this command to move all your files after changing the naming scheme. .. warning:: Since this command moves you documents around alot, it is advised to to a backup before. The renaming logic is robust and will never overwrite or delete a file, but you can't ever be careful enough. .. code:: document_renamer The command takes no arguments and processes all your documents at once. .. _utilities-encyption: Managing encryption =================== Documents can be stored in Paperless using GnuPG encryption. .. danger:: Decryption is depreceated since paperless-ng 1.0 and doesn't really provide any additional security, since you have to store the passphrase in a configuration file on the same system as the encrypted documents for paperless to work. Also, paperless provides transparent access to your encrypted documents. Consider running paperless on an encrypted filesystem instead, which will then at least provide security against physical hardware theft. .. code:: change_storage_type [--passphrase PASSPHRASE] {gpg,unencrypted} {gpg,unencrypted} positional arguments: {gpg,unencrypted} The state you want to change your documents from {gpg,unencrypted} The state you want to change your documents to optional arguments: --passphrase PASSPHRASE Enabling encryption ------------------- Basic usage to enable encryption of your document store (**USE A MORE SECURE PASSPHRASE**): (Note: If ``PAPERLESS_PASSPHRASE`` isn't set already, you need to specify it here) .. code:: change_storage_type [--passphrase SECR3TP4SSPHRA$E] unencrypted gpg Disabling encryption -------------------- Basic usage to enable encryption of your document store: (Note: Again, if ``PAPERLESS_PASSPHRASE`` isn't set already, you need to specify it here) .. code:: change_storage_type [--passphrase SECR3TP4SSPHRA$E] gpg unencrypted .. _Pipenv: https://pipenv.pypa.io/en/latest/