From be2b59431f976deac8df9f71ea8f1c340f815da8 Mon Sep 17 00:00:00 2001 From: Alexey Zinchenko Date: Wed, 30 Mar 2022 13:27:09 +0300 Subject: [PATCH] Update troubleshooting.rst --- docs/troubleshooting.rst | 13 +++++++++++++ 1 file changed, 13 insertions(+) diff --git a/docs/troubleshooting.rst b/docs/troubleshooting.rst index f56d2cd09..3ae4909de 100644 --- a/docs/troubleshooting.rst +++ b/docs/troubleshooting.rst @@ -25,6 +25,19 @@ Check for the following issues: * Go to the admin interface, and check if there are failed tasks. If so, the tasks will contain an error message. +Consumer warns ``OCR for XX failed`` +#################################### + +If you find the OCR accuracy to be too low, and/or the document consumer warns +that ``OCR for XX failed, but we're going to stick with what we've got since +FORGIVING_OCR is enabled``, then you might need to install the +`Tesseract language files `_ +marching your document's languages. + +As an example, if you are running Paperless-ngx from any Ubuntu or Debian +box, and your documents are written in Spanish you may need to run:: + + apt-get install -y tesseract-ocr-spa Consumer fails to pickup any new files ######################################