rten-text 0.24.0

Text tokenization and other ML pre/post-processing functions
Documentation
1
2
3
4
5
6
7
8
# rten-text

Library containing text tokenization and related functionality, for preparing
inputs and decoding outputs for text models (eg. BERT).

The functionality is a subset of that found in [Hugging Face
Tokenizers](https://github.com/huggingface/tokenizers). It has less
functionality, but also fewer dependencies, and none that require C/C++.