From e6b856e13f1de2daaf7887a8457044a78bacead5 Mon Sep 17 00:00:00 2001 From: shamoon <4887959+shamoon@users.noreply.github.com> Date: Thu, 23 May 2024 12:52:40 -0700 Subject: [PATCH] Documentation: Add PAPERLESS_OCR_LANGUAGE config note (#6821) --- docs/configuration.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/docs/configuration.md b/docs/configuration.md index bbe46f4de..254508f31 100644 --- a/docs/configuration.md +++ b/docs/configuration.md @@ -616,6 +616,8 @@ parsing documents. Keep in mind that Tesseract uses much more CPU time with multiple languages enabled. + If you are including languages that are not installed by default, you will need to also set [`PAPERLESS_OCR_LANGUAGES`](configuration.md#PAPERLESS_OCR_LANGUAGES) for docker deployments or install the tesseract language packages manually for bare metal installations. + Defaults to "eng". !!! note