mirror of
https://github.com/paperless-ngx/paperless-ngx.git
synced 2025-08-07 19:08:32 -05:00
Improves the docs: OCRing files in languages other than English + fixes typos
This commit is contained in:
18
docs/troubleshooting.rst
Normal file
18
docs/troubleshooting.rst
Normal file
@@ -0,0 +1,18 @@
|
||||
.. _troubleshooting:
|
||||
|
||||
Troubleshooting
|
||||
===============
|
||||
|
||||
.. _troubleshooting_ocr_language_files_missing:
|
||||
|
||||
Consumer warns ``OCR for XX failed``
|
||||
------------------------------------
|
||||
|
||||
If you find the OCR accuracy to be too low, and/or the document consumer warns that ``OCR for
|
||||
XX failed, but we're going to stick with what we've got since FORGIVING_OCR is enabled``, then you
|
||||
might need to install the `Tesseract language files
|
||||
<http://packages.ubuntu.com/search?keywords=tesseract-ocr>`_ marching your documents languages.
|
||||
|
||||
As an example, if your documents are written in Spanish you may need to run::
|
||||
|
||||
apt-get install -y tesseract-ocr-spa
|
Reference in New Issue
Block a user