Improves the docs: OCRing files in languages other than English + fixes typos

This commit is contained in:
Jaime Gómez
2016-03-21 21:57:36 +01:00
parent 840626e571
commit 8115cf8905
5 changed files with 22 additions and 3 deletions

18
docs/troubleshooting.rst Normal file
View File

@@ -0,0 +1,18 @@
.. _troubleshooting:
Troubleshooting
===============
.. _troubleshooting_ocr_language_files_missing:
Consumer warns ``OCR for XX failed``
------------------------------------
If you find the OCR accuracy to be too low, and/or the document consumer warns that ``OCR for
XX failed, but we're going to stick with what we've got since FORGIVING_OCR is enabled``, then you
might need to install the `Tesseract language files
<http://packages.ubuntu.com/search?keywords=tesseract-ocr>`_ marching your documents languages.
As an example, if your documents are written in Spanish you may need to run::
apt-get install -y tesseract-ocr-spa