diff roundup/backends/indexer_common.py @ 6599:39189dd94f2c

issue2551189 - increase size of words in full text index. Increased indexed word maxlength to 50 DB migration code is written and tests work. Restructured some tests to allow for code reuse. Docs. If this passes CI without errors 2551189 should be done. However, testing on my system generates errors. Encoding (indexer unicode russian unicode string invalid) and collation errors (utf8_bin not valid) when running under python2. No issues with python3 and I haven't changed code that should cause these since the last successful build in CI. So if this fails in CI we will have more checkins.
author John Rouillard <rouilj@ieee.org>
date Wed, 26 Jan 2022 15:04:09 -0500
parents 91ab3e0ffcd0
children 3260926d7e7e
line wrap: on
line diff
--- a/roundup/backends/indexer_common.py	Wed Jan 26 08:58:46 2022 -0500
+++ b/roundup/backends/indexer_common.py	Wed Jan 26 15:04:09 2022 -0500
@@ -19,10 +19,11 @@
         self.stopwords = set(STOPWORDS)
         for word in db.config[('main', 'indexer_stopwords')]:
             self.stopwords.add(word)
-        # Do not index anything longer than 25 characters since that'll be
-        # gibberish (encoded text or somesuch) or shorter than 2 characters
+        # Do not index anything longer than maxlength characters since
+        # that'll be gibberish (encoded text or somesuch) or shorter
+        # than 2 characters
         self.minlength = 2
-        self.maxlength = 25
+        self.maxlength = 50
         self.language = db.config[('main','indexer_language')]
         # Some indexers have a query language. If that is the case,
         # we don't parse the user supplied query into a wordlist.

Roundup Issue Tracker: http://roundup-tracker.org/