From 971192f6f396e2ef10fd848038ac6367e0a8abfd Mon Sep 17 00:00:00 2001 From: pewter77 Date: Fri, 30 Jul 2021 23:21:22 +0800 Subject: [PATCH 1/2] update configuration docs for languages Due to tesseract naming scheme, packages use '-' but the internal OCR uses '_' . This is to clarify in the documentation that this is the case for new users. --- docs/configuration.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/docs/configuration.rst b/docs/configuration.rst index 62880208f..dd1e6341c 100644 --- a/docs/configuration.rst +++ b/docs/configuration.rst @@ -257,6 +257,8 @@ PAPERLESS_OCR_LANGUAGE= languages enabled. Defaults to "eng". + + Note: If your language contains a '-' such as chi-sim, you must use chi_sim PAPERLESS_OCR_MODE= Tell paperless when and how to perform ocr on your documents. Four modes From 45fd4e8565d8a73c1fb51d6d1cb4f29dd5b7a6ea Mon Sep 17 00:00:00 2001 From: jonaswinkler <17569239+jonaswinkler@users.noreply.github.com> Date: Sun, 22 Aug 2021 16:27:44 +0200 Subject: [PATCH 2/2] fix whitespace --- docs/configuration.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/configuration.rst b/docs/configuration.rst index dd1e6341c..bd3fe1a4f 100644 --- a/docs/configuration.rst +++ b/docs/configuration.rst @@ -257,7 +257,7 @@ PAPERLESS_OCR_LANGUAGE= languages enabled. Defaults to "eng". - + Note: If your language contains a '-' such as chi-sim, you must use chi_sim PAPERLESS_OCR_MODE=