************** Administration ************** Making backups ############## .. warning:: This section is not updated yet. So you're bored of this whole project, or you want to make a remote backup of your files for whatever reason. This is easy to do, simply use the :ref:`exporter ` to dump your documents and database out into an arbitrary directory. .. _migrating-restoring: Restoring ========= Restoring your data is just as easy, since nearly all of your data exists either in the file names, or in the contents of the files themselves. You just need to create an empty database (just follow the :ref:`installation instructions ` again) and then import the ``tags.json`` file you created as part of your backup. Lastly, copy your exported documents into the consumption directory and start up the consumer. .. code-block:: shell-session $ cd /path/to/project $ rm data/db.sqlite3 # Delete the database $ cd src $ ./manage.py migrate # Create the database $ ./manage.py createsuperuser $ ./manage.py loaddata /path/to/arbitrary/place/tags.json $ cp /path/to/exported/docs/* /path/to/consumption/dir/ $ ./manage.py document_consumer Importing your data if you are :ref:`using Docker ` is almost as simple: .. code-block:: shell-session # Stop and remove your current containers $ docker-compose stop $ docker-compose rm -f # Recreate them, add the superuser $ docker-compose up -d $ docker-compose run --rm webserver createsuperuser # Load the tags $ cat /path/to/arbitrary/place/tags.json | docker-compose run --rm webserver loaddata_stdin - # Load your exported documents into the consumption directory # (How you do this highly depends on how you have set this up) $ cp /path/to/exported/docs/* /path/to/mounted/consumption/dir/ After loading the documents into the consumption directory the consumer will immediately start consuming the documents. .. _administration-updating: Updating paperless ################## .. warning:: This section is not updated yet. For the most part, all you have to do to update Paperless is run ``git pull`` on the directory containing the project files, and then use Django's ``migrate`` command to execute any database schema updates that might have been rolled in as part of the update: .. code-block:: shell-session $ cd /path/to/project $ git pull $ pip install -r requirements.txt $ cd src $ ./manage.py migrate Note that it's possible (even likely) that while ``git pull`` may update some files, the ``migrate`` step may not update anything. This is totally normal. Additionally, as new features are added, the ability to control those features is typically added by way of an environment variable set in ``paperless.conf``. You may want to take a look at the ``paperless.conf.example`` file to see if there's anything new in there compared to what you've got in ``/etc``. If you are :ref:`using Docker ` the update process is similar: .. code-block:: shell-session $ cd /path/to/project $ git pull $ docker build -t paperless . $ docker-compose run --rm consumer migrate $ docker-compose up -d If ``git pull`` doesn't report any changes, there is no need to continue with the remaining steps. This depends on the route you've chosen to run paperless. a. If you are not using docker, update python requirements. Paperless uses `Pipenv`_ for managing dependencies: .. code:: bash $ pip install --upgrade pipenv $ cd /path/to/paperless $ pipenv install This creates a new virtual environment (or uses your existing environment) and installs all dependencies into it. Running commands inside the environment is done via .. code:: bash $ cd /path/to/paperless/src $ pipenv run python3 manage.py my_command You will also need to build the frontend each time a new update is pushed. See updating paperless for more information. TODO REFERENCE b. If you are using docker, build the docker image. .. code:: bash $ docker build -t jonaswinkler/paperless-ng:latest . Copy either docker-compose.yml.example or docker-compose.yml.sqlite.example to docker-compose.yml and adjust the consumption directory. Management utilities #################### Paperless comes with some management commands that perform various maintenance tasks on your paperless instance. You can invoce these commands either by .. code:: bash $ cd /path/to/paperless $ docker-compose run --rm webserver or .. code:: bash $ cd /path/to/paperless/src $ pipenv run python manage.py depending on whether you use docker or not. All commands have built-in help, which can be accessed by executing them with the argument ``--help``. Document exporter ================= The document exporter exports all your data from paperless into a folder for backup or migration to another DMS. .. code:: document_exporter target ``target`` is a folder to which the data gets written. This includes documents, thumbnails and a ``manifest.json`` file. The manifest contains all metadata from the database (correspondents, tags, etc). When you use the provided docker compose script, specify ``../export`` as the target. This path inside the container is automatically mounted on your host on the folder ``export``. .. _utilities-importer: Document importer ================= The document importer takes the export produced by the `Document exporter`_ and imports it into paperless. The importer works just like the exporter. You point it at a directory, and the script does the rest of the work: .. code:: document_importer source When you use the provided docker compose script, put the export inside the ``export`` folder in your paperless source directory. Specify ``../export`` as the ``source``. .. _utilities-retagger: Document retagger ================= Say you've imported a few hundred documents and now want to introduce a tag or set up a new correspondent, and apply its matching to all of the currently-imported docs. This problem is common enough that there are tools for it. .. code:: document_retagger [-h] [-c] [-T] [-t] [-i] [--use-first] [-f] optional arguments: -c, --correspondent -T, --tags -t, --document_type -i, --inbox-only --use-first -f, --overwrite Run this after changing or adding matching rules. It'll loop over all of the documents in your database and attempt to match documents according to the new rules. Specify any combination of ``-c``, ``-T`` and ``-t`` to have the retagger perform matching of the specified metadata type. If you don't specify any of these options, the document retagger won't do anything. Specify ``-i`` to have the document retagger work on documents tagged with inbox tags only. This is useful when you don't want to mess with your already processed documents. When multiple document types or correspondents match a single document, the retagger won't assign these to the document. Specify ``--use-first`` to override this behaviour and just use the first correspondent or type it finds. This option does not apply to tags, since any amount of tags can be applied to a document. Finally, ``-f`` specifies that you wish to overwrite already assigned correspondents, types and/or tags. The default behaviour is to not assign correspondents and types to documents that have this data already assigned. ``-f`` works differently for tags: By default, only additional tags get added to documents, no tags will be removed. With ``-f``, tags that don't match a document anymore get removed as well. Managing the Automatic matching algorithm ========================================= The *Auto* matching algorithm requires a trained neural network to work. This network needs to be updated whenever somethings in your data changes. The docker image takes care of that automatically with the task scheduler. You can manually renew the classifier by invoking the following management command: .. code:: document_create_classifier This command takes no arguments. Managing the document search index ================================== The document search index is responsible for delivering search results for the website. The document index is automatically updated whenever documents get added to, changed, or removed from paperless. However, if the search yields non-existing documents or won't find anything, you may need to recreate the index manually. .. code:: document_index {reindex,optimize} Specify ``reindex`` to have the index created from scratch. This may take some time. Specify ``optimize`` to optimize the index. This updates certain aspects of the index and usually makes queries faster and also ensures that the autocompletion works properly. This command is regularly invoked by the task scheduler. Managing filenames ================== .. warning:: TBD .. code:: document_renamer .. _utilities-encyption: Managing encrpytion =================== Documents can be stored in Paperless using GnuPG encryption. .. danger:: Decryption is depreceated since paperless-ng 1.0 and doesn't really provide any additional security, since you have to store the passphrase in a configuration file on the same system as the encrypted documents for paperless to work. Also, paperless provides transparent access to your encrypted documents. Consider running paperless on an encrypted filesystem instead, which will then at least provide security against physical hardware theft. .. code:: change_storage_type [--passphrase PASSPHRASE] {gpg,unencrypted} {gpg,unencrypted} positional arguments: {gpg,unencrypted} The state you want to change your documents from {gpg,unencrypted} The state you want to change your documents to optional arguments: --passphrase PASSPHRASE Enabling encryption ------------------- Basic usage to enable encryption of your document store (**USE A MORE SECURE PASSPHRASE**): (Note: If ``PAPERLESS_PASSPHRASE`` isn't set already, you need to specify it here) .. code:: change_storage_type [--passphrase SECR3TP4SSPHRA$E] unencrypted gpg Disabling encryption -------------------- Basic usage to enable encryption of your document store: (Note: Again, if ``PAPERLESS_PASSPHRASE`` isn't set already, you need to specify it here) .. code:: change_storage_type [--passphrase SECR3TP4SSPHRA$E] gpg unencrypted .. _Pipenv: https://pipenv.pypa.io/en/latest/