charabia 0.8.8

A simple library to detect the language, tokenize the text and normalize the tokens
Documentation

charabia

There is very little structured metadata to build this page from currently. You should check the main library docs, readme, or Cargo.toml in case the author documented the features in them.

This version has 14 feature flags, 11 of them enabled by default.

default

chinese (default)

greek (default)

This feature flag does not enable additional features.

hebrew (default)

This feature flag does not enable additional features.

japanese (default)

khmer (default)

This feature flag does not enable additional features.

korean (default)

latin-camelcase (default)

latin-snakecase (default)

thai (default)

This feature flag does not enable additional features.

vietnamese (default)

This feature flag does not enable additional features.

japanese-segmentation-unidic (default)

japanese-segmentation-ipadic

japanese-transliteration

lindera-tokenizer