Expand description
Chinese pinyin and CC-CEDICT adapters.
Structs§
- Cedict
Index Options - Options used while building a CC-CEDICT reading index.
- Cedict
Reading Index - CC-CEDICT-derived surface-to-pinyin reading index.
- Chinese
Distance - Distances computed for one Chinese comparison.
- Loaded
Bundle - Verified Chinese dictionary bundle ready for scoring.
- Pinyin
Reading Expansion - Reading-path expansion result plus pruning statistics.
- Pinyin
Reading Options - Controls Chinese dictionary reading-path expansion.
- Pinyin
Reading Path - One complete segmentation and joined pinyin reading for an input string.
- Pinyin
Reading Segment - One Chinese surface segment and its selected pinyin reading.
- Pinyin
Reading Stats - Counters describing Chinese reading-path expansion.
- ZhArtifact
Build - Build-time settings recorded in Chinese artifact metadata.
- ZhArtifact
License - License metadata for a Chinese dictionary artifact.
- ZhArtifact
License Reference - One license reference stored in Chinese artifact metadata.
- ZhArtifact
Metadata - Metadata stored in a Chinese dictionary bundle.
- ZhArtifact
Metadata Options - Inputs used to build Chinese artifact metadata from an index.
- ZhArtifact
Payload - Payload metadata stored in a Chinese dictionary bundle.
- ZhArtifact
Query Defaults - Default reading expansion options recorded in Chinese artifact metadata.
- ZhArtifact
Source - Source dictionary metadata for a Chinese artifact.
- ZhIndexed
Artifact Payload Header - Header for indexed FST Chinese payloads.
- ZhReading
Index Payload - Normalized Chinese reading-index payload.
- ZhReading
Index Payload Entry - One surface form and its normalized pinyin readings.
Enums§
- Bundle
Load Error - Errors returned when loading and verifying a Chinese dictionary bundle.
- Cedict
Error - Errors returned while parsing CC-CEDICT source text.
- CnLattice
Error - Errors returned while building Chinese pinyin lattices.
- Pinyin
View - Pinyin representation used by a Chinese reading index.
- ZhArtifact
Payload Error - Errors returned while loading or validating Chinese artifact payloads.
Constants§
- ARTIFACT_
PAYLOAD_ CHECKSUM_ ALGORITHM - Current canonical checksum algorithm for normalized Chinese payload content.
- ARTIFACT_
PAYLOAD_ FILE_ DIGEST_ ALGORITHM - File digest algorithm used to verify payload bytes before loading.
Functions§
- artifact_
file_ digest_ path - Computes the SHA-256 file digest string for a Chinese artifact payload file.
- artifact_
file_ digest_ reader - Computes the SHA-256 file digest string from a reader.
- cedict_
or_ direct_ lattice - Builds a pinyin lattice from direct input, CC-CEDICT readings, or both.
- compare_
with_ cedict_ index - Compares two strings using direct pinyin handling and a CC-CEDICT index.
- compare_
with_ zh_ index - Compares two strings using direct pinyin handling and a Chinese index.
- load_
bundle - Loads and verifies a Chinese dictionary bundle.
- normalize_
pinyin - Normalizes a whitespace-separated CC-CEDICT pinyin field.
- normalized_
similarity_ with_ zh_ index - Computes the best normalized similarity across Chinese pinyin readings.
- pinyin_
lattice_ from_ reading_ paths - Builds a pinyin lattice from expanded reading paths.
- zh_
or_ direct_ lattice - Builds a pinyin lattice from direct input, dictionary readings, or both.
- zh_
or_ direct_ pinyin_ paths - Returns pinyin paths from direct input, dictionary readings, or both.
Type Aliases§
- ZhReading
Index - Public alias for the Chinese reading index type.