Module fosslim::tokenizer [] [src]

Functions

tokenize_overlapping_ngrams

It splits original text into word ngrams, which are overlapping by (n-1) elements

tokenize_whitespace