Commit Graph

6 Commits

Author SHA1 Message Date
Trenton H
b4e3334646 Feature: Allow deletion of documents via the fuzzy matching command (#4957)
* Adds new flag allowing deletion of one of a document pair which is over the match ratio

* Documents the new command option
2023-12-17 18:37:38 -08:00
Trenton H
7e56dfd1bd Uses regex matching against the messages, as ordering and exact matched value don't matter, just the count of matches 2023-09-12 08:59:09 -07:00
Trenton H
9e333bf91d Cleans up the docs, adds validation of the process count, include the test descriptions 2023-09-12 08:17:12 -07:00
Trenton H
e82aaa7d85 Adds multiprocessing to the fuzzy matching for a speedup 2023-09-12 08:17:12 -07:00
shamoon
060966aa22 Add progress bar to document_fuzzy_match 2023-09-12 08:17:12 -07:00
Trenton Holmes
e4bf3c6ac4 Implements a new command for fuzzy matching document content and reporting potential duplicates 2023-09-12 08:17:12 -07:00