Tsonga language model for Lingua
This is the language model for the Tsonga language which is used by Lingua, the most accurate natural language detection library in the Rust ecosystem.
Changelog
Version 1.3.0
- The language model files have been converted into a new storage format. They are now stored as finite-state transducers (FSTs) which reduces memory consumption drastically at the cost of a slightly slower runtime performance.
Version 1.2.0
- The language model has been enhanced by including unique and most common ngrams to support an absolute confidence metric which is independent of other languages.
Version 1.1.0
- The language model files are now compressed with the Brotli algorithm which reduces the file size by 15 %, on average.