Function wana_kana::tokenize::tokenize_with_opt
source · Available on crate feature
tokenize
only.Expand description
Tokenizes the text. Splits input into array of strings separated by opinionated
TokenType
.
If compact
is set, many same-language tokens are combined (spaces + text, kanji + kana,
numeral + punctuation).