Skip to main contentCrate opencc_fmmseg
Source - dictionary_lib
- Dictionary utilities for managing multiple OpenCC lexicons.
Internal dictionary-processing utilities for
opencc-fmmseg.
- debug_note
- Print a developer note to stderr in debug builds; no-op in release.
- DelimiterSet
- Compact, hot-path-friendly delimiter set optimized for per-character
membership tests.
- DictRefs
- Holds up to three conversion rounds. Each round carries its own
dictionaries,
max_len, and prebuilt StarterUnion. - OpenCC
- Central interface for performing OpenCC-based conversion with segmentation.
- OpenccConfig
- OpenCC conversion configuration (strongly-typed).
- find_max_utf8_len_bytes
- Finds a safe UTF-8 boundary within a raw byte slice, limited by a maximum byte count.
- find_max_utf8_length
- Finds a valid UTF-8 boundary within the given string, limited by a maximum byte count.
- for_each_len_dec
- Iterates viable phrase lengths in descending order using a starter bitmask,
stopping early if the callback returns
true. - is_delimiter
- Checks whether a character is treated as a segmentation delimiter.