Skip to main contentCrate opencc_fmmseg
Source - dictionary_lib
- Dictionary utilities for managing multiple OpenCC lexicons.
Internal dictionary-processing utilities for
opencc-fmmseg.
- debug_note
- Print a developer note to stderr in debug builds; no-op in release.
- DelimiterSet
- Convenience helper for hot paths: tests if a
char is a delimiter using
the global FULL_DELIMITER_SET. - DictRefs
- Holds up to three conversion rounds. Each round carries its own
dictionaries,
max_len, and prebuilt StarterUnion. - OpenCC
- Central interface for performing OpenCC-based conversion with segmentation.
- OpenccConfig
- OpenCC conversion configuration (strongly-typed).
- FULL_DELIMITER_SET
- Global static instance of the
DelimiterSet constructed from
[FULL_DELIMITERS].
- find_max_utf8_len_bytes
- Finds a safe UTF-8 boundary within a raw byte slice, limited by a maximum byte count.
- find_max_utf8_length
- Finds a valid UTF-8 boundary within the given string, limited by a maximum byte count.
- for_each_len_dec
- Iterates viable phrase lengths in descending order using a starter bitmask,
stopping early if the callback returns
true. - is_delimiter
- Checks whether a character is treated as a segmentation delimiter.