Skip to main content

Crate opencc_fmmseg

Crate opencc_fmmseg 

Source

Modules§

dictionary_lib
Dictionary utilities for managing multiple OpenCC lexicons. Internal dictionary-processing utilities for opencc-fmmseg.

Macros§

debug_note
Print a developer note to stderr in debug builds; no-op in release.

Structs§

DelimiterSet
Convenience helper for hot paths: tests if a char is a delimiter using the global FULL_DELIMITER_SET.
DictRefs
Holds up to three conversion rounds. Each round carries its own dictionaries, max_len, and prebuilt StarterUnion.
OpenCC
Central interface for performing OpenCC-based conversion with segmentation.

Enums§

OpenccConfig
OpenCC conversion configuration (strongly-typed).

Statics§

FULL_DELIMITER_SET
Global static instance of the DelimiterSet constructed from [FULL_DELIMITERS].

Functions§

find_max_utf8_len_bytes
Finds a safe UTF-8 boundary within a raw byte slice, limited by a maximum byte count.
find_max_utf8_length
Finds a valid UTF-8 boundary within the given string, limited by a maximum byte count.
for_each_len_dec
Iterates viable phrase lengths in descending order using a starter bitmask, stopping early if the callback returns true.
is_delimiter
Checks whether a character is treated as a segmentation delimiter.