Expand description
Dictionary builder support for gukhanmun-mkdict.
The crate owns parsers for normalized dictionary inputs and writers for the first on-disk FST and CDB dictionary formats. Runtime lookup is handled by backend crates.
Structs§
- Build
Options - Options controlling dictionary file construction.
- Dictionary
Entry - One normalized dictionary entry after parsing and merge handling.
- Entry
Mark - Dictionary-provided rendering constraints encoded in built files.
- Rule
- One row from a rules file: a selector that picks dictionary entries and the
mark bits to OR into their
EntryMark.
Enums§
- Dictionary
Format - The supported output backend format for this implementation step.
- Error
- Error returned while parsing inputs or building dictionary files.
- Merge
Policy - Conflict policy used when the same hanja key appears more than once.
- Rule
Kind - Selector kind used by a rules-file row.
Constants§
- DEFAULT_
MAX_ KEY_ BYTES - The maximum accepted UTF-8 key length when the CLI option is omitted.
Functions§
- apply_
rules - Applies parsed rules to dictionary entries by OR-merging their mark bits.
- build_
dictionary - Builds a dictionary file from normalized TSV, CSV, or JSONL inputs.
- parse_
metadata_ arg - Parses one
KEY=VALmetadata argument. - parse_
rules_ file - Parses a rules TSV file.
Type Aliases§
- Result
- Result type returned by dictionary builder APIs.