Skip to main content

Crate gukhanmun_mkdict

Crate gukhanmun_mkdict 

Source
Expand description

Dictionary builder support for gukhanmun-mkdict.

The crate owns parsers for normalized dictionary inputs and writers for the first on-disk FST and CDB dictionary formats. Runtime lookup is handled by backend crates.

Structs§

BuildOptions
Options controlling dictionary file construction.
DictionaryEntry
One normalized dictionary entry after parsing and merge handling.
EntryMark
Dictionary-provided rendering constraints encoded in built files.
Rule
One row from a rules file: a selector that picks dictionary entries and the mark bits to OR into their EntryMark.

Enums§

DictionaryFormat
The supported output backend format for this implementation step.
Error
Error returned while parsing inputs or building dictionary files.
MergePolicy
Conflict policy used when the same hanja key appears more than once.
RuleKind
Selector kind used by a rules-file row.

Constants§

DEFAULT_MAX_KEY_BYTES
The maximum accepted UTF-8 key length when the CLI option is omitted.

Functions§

apply_rules
Applies parsed rules to dictionary entries by OR-merging their mark bits.
build_dictionary
Builds a dictionary file from normalized TSV, CSV, or JSONL inputs.
parse_metadata_arg
Parses one KEY=VAL metadata argument.
parse_rules_file
Parses a rules TSV file.

Type Aliases§

Result
Result type returned by dictionary builder APIs.