From a2c9f6792e052cc90928095ea51602087d14b732 Mon Sep 17 00:00:00 2001 From: Trenton H <797416+stumpylog@users.noreply.github.com> Date: Tue, 19 Sep 2023 07:46:13 -0700 Subject: [PATCH] Fixes the documentation for fuzzy matching (#4207) --- docs/advanced_usage.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/docs/advanced_usage.md b/docs/advanced_usage.md index 957d5287e..3980261e6 100644 --- a/docs/advanced_usage.md +++ b/docs/advanced_usage.md @@ -35,7 +35,8 @@ The following algorithms are available: (i.e. preserve ordering) in the PDF. - **Regular expression:** Parses the match as a regular expression and tries to find a match within the document. -- **Fuzzy match:** I don't know. Look at [the source](https://github.com/paperless-ngx/paperless-ngx/blob/main/src/documents/matching.py). +- **Fuzzy match:** Uses a partial matching based on locating the tag text + inside the document, using a [partial ratio](https://maxbachmann.github.io/RapidFuzz/Usage/fuzz.html#partial-ratio) - **Auto:** Tries to automatically match new documents. This does not require you to set a match. See the [notes below](#automatic-matching).