Skip to main content

Module dictionary

Module dictionary 

Source

Structs§

Dictionary
Dictionary holding vocabulary, tokenization logic, and open-addressing hash table.
Entry
A vocabulary entry.

Enums§

EntryType
Word entry type.

Constants§

BOW
Beginning-of-word marker for subword computation.
EOS
EOS (end-of-sentence) token string.
EOW
End-of-word marker for subword computation.
MAX_LINE_SIZE
Maximum tokens per line.
MAX_VOCAB_SIZE
Maximum vocabulary size for the open-addressing hash table.