Structs§
- Text
Processor - Configurable text preprocessing pipeline.
Functions§
- build_
vocab - Builds a vocabulary map and reverse lookup from tokenized sentences.
- build_
vocab_ with_ freq - Builds a vocabulary map, reverse lookup, and per-ID word frequencies from tokenized sentences.
- load_
text_ data - Tokenizes text using the default
TextProcessorsettings. - load_
text_ data_ advanced - Tokenizes text using a custom
TextProcessor.