From 3ed6d4bc7a9873f7db6a4ca8348e3c80505c9097 Mon Sep 17 00:00:00 2001 From: Alexey Zinchenko Date: Wed, 30 Mar 2022 13:27:09 +0300 Subject: [PATCH] Update troubleshooting.rst --- docs/troubleshooting.rst | 13 +++++++++++++ 1 file changed, 13 insertions(+) diff --git a/docs/troubleshooting.rst b/docs/troubleshooting.rst index f3ea612a0..c5c669b18 100644 --- a/docs/troubleshooting.rst +++ b/docs/troubleshooting.rst @@ -25,6 +25,19 @@ Check for the following issues: * Go to the admin interface, and check if there are failed tasks. If so, the tasks will contain an error message. +Consumer warns ``OCR for XX failed`` +#################################### + +If you find the OCR accuracy to be too low, and/or the document consumer warns +that ``OCR for XX failed, but we're going to stick with what we've got since +FORGIVING_OCR is enabled``, then you might need to install the +`Tesseract language files `_ +marching your document's languages. + +As an example, if you are running Paperless-ngx from any Ubuntu or Debian +box, and your documents are written in Spanish you may need to run:: + + apt-get install -y tesseract-ocr-spa Consumer fails to pickup any new files ######################################