You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add ability to boost liklihood of specific languages
Add the ability to specify a list of languages to have their
scores boosted, and an amount (as a percentage) to boost them.
Tidy up and fix errors in help text.
Copy file name to clipboardExpand all lines: README.md
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,6 +25,7 @@ Updates from the original version include:
25
25
* Allow specification of a minimum input length (-j); shorter strings will not be identified. Mininimum length does not count non-word characters.
26
26
* Allow specification of a maximum proportion of highest (i.e., worst) possible score (-p), to filter "junk" texts mostly made of unknown characters and n-grams, and to a lesser extent texts in languages that are not even similar to the models in use.
27
27
* Merge n-gram count for input text and language model size to one shared value.
28
+
* Allow boosting of particular languages in results (based, for example, on a priori knowledge of the likelihood of various languages being present).
0 commit comments