charabia 0.7.0

A simple library to detect the language, tokenize the text and normalize the tokens
Documentation

charabia

There is very little structured metadata to build this page from currently. You should check the main library docs, readme, or Cargo.toml in case the author documented the features in them.

This version has 11 feature flags, 6 of them enabled by default.

default

  • chinese
  • hebrew
  • japanese
  • thai
  • korean

chinese

  • dep:pinyin
  • dep:jieba-rs

hebrew

    This feature flag does not enable additional features.

japanese

  • lindera/ipadic

thai

    This feature flag does not enable additional features.

korean

  • lindera/ko-dic

japanese-transliteration

  • dep:wana_kana

wana_kana

    This feature flag does not enable additional features.

jieba-rs

    This feature flag does not enable additional features.

lindera

    This feature flag does not enable additional features.

pinyin

    This feature flag does not enable additional features.