Crate yeslogic_ucd_parse
source · [−]Expand description
A library for parsing the Unicode character database.
Modules
Structs for parsing files in the
extracted
subdirectory.Structs
A single row in the
DerivedAge.txt
file.Represents a single row in the
ArabicShaping.txt
file.Represents a single row in the
BidiMirroring.txt
file.A single row in the
CaseFolding.txt
file.A single Unicode codepoint.
An iterator over a range of Unicode codepoints.
A range of Unicode codepoints. The range is inclusive; both ends of the
range are guaranteed to be valid codepoints.
A single row in the
DerivedCoreProperties.txt
file.A single row in the
EastAsianWidth.txt
file, describing the value of the
East_Asian_Width
property.A single row in the
emoji-data.txt
file.Represents any kind of error that can occur while parsing the UCD.
A single row in the
auxiliary/GraphemeBreakProperty.txt
file.A single row in the
auxiliary/GraphemeBreakTest.txt
file.A single row in the
Jamo.txt
file.A single row in the
auxiliary/LineBreakTest.txt
file.A single row in the
NameAliases.txt
file.A single row in the
PropList.txt
file.A single row in the
PropertyAliases.txt
file.A single row in the
PropertyValueAliases.txt
file.A single row in the
Scripts.txt
file.A single row in the
ScriptExtensions.txt
file.A single row in the
auxiliary/SentenceBreakProperty.txt
file.A single row in the
auxiliary/SentenceBreakTest.txt
file.A single row in the
SpecialCasing.txt
file.A line oriented parser for a particular UCD file.
Represents a single row in the
UnicodeData.txt
file.Represents a decomposition mapping of a single row in the
UnicodeData.txt
file.An iterator adapter that expands rows in
UnicodeData.txt
.A single row in the
auxiliary/WordBreakProperty.txt
file.A single row in the
auxiliary/WordBreakTest.txt
file.Enums
The status of a particular case mapping.
A representation of either a single codepoint or a range of codepoints.
The kind of error that occurred while parsing the UCD.
The label of a name alias.
The formatting tag on a decomposition mapping.
A numeric value corresponding to characters with
Numeric_Type=Numeric
.Traits
Describes a single UCD file.
Describes a single UCD file where every record in the file is associated
with one or more codepoints.
Functions
Parse a particular file in the UCD into a sequence of rows.
Parse a particular file in the UCD into a map from codepoint to the record.
Parse a particular file in the UCD into a map from codepoint to all
records associated with that codepoint.
Given a path pointing at the root of the
ucd_dir
, attempts to determine
it’s unicode version.