Adds step to bare metal setup regarding downloading the required NLTK data

This commit is contained in:
Trenton H 2022-10-05 12:19:07 -07:00
parent d08eb0c66b
commit dafefa33d6
2 changed files with 8 additions and 1 deletions

2
Pipfile.lock generated
View File

@ -1,7 +1,7 @@
{
"_meta": {
"hash": {
"sha256": "79ef8a0dae2a57c93935fa6ee7c591b53a64cf8c6925d16dc95aa8f8a937f9c7"
"sha256": "d73f1cfcc2ea4741e109317a991b271db292e708c119c560c523b0c74401d5fb"
},
"pipfile-spec": 6,
"requires": {},

View File

@ -520,6 +520,13 @@ writing. Windows is not and will never be supported.
to compile this by yourself, because this software has been patented until around 2017 and
binary packages are not available for most distributions.
15. Optional: If using the NLTK machine learning processing (see ``PAPERLESS_ENABLE_NLTK`` in
:ref:`configuration` for details), download the NLTK data for the Snowball Stemmer, Stopwords
and Punkt tokenizer to your ``PAPERLESS_DATA_DIR/nltk``. Refer to
the `NLTK instructions <https://www.nltk.org/data.html>`_ for details on how to
download the data.
Migrating to Paperless-ngx
##########################