Struct ucd_parse::UnicodeData
[−]
[src]
pub struct UnicodeData {
pub codepoint: Codepoint,
pub name: String,
pub general_category: String,
pub canonical_combining_class: u8,
pub bidi_class: String,
pub decomposition: UnicodeDataDecomposition,
pub numeric_type_decimal: Option<u8>,
pub numeric_type_digit: Option<u8>,
pub numeric_type_numeric: Option<UnicodeDataNumeric>,
pub bidi_mirrored: bool,
pub unicode1_name: String,
pub iso_comment: String,
pub simple_uppercase_mapping: Option<Codepoint>,
pub simple_lowercase_mapping: Option<Codepoint>,
pub simple_titlecase_mapping: Option<Codepoint>,
}Represents a single row in the UnicodeData.txt file.
These fields were taken from UAX44, Table 9, as part of the documentation
for the
UnicodeData.txt file.
Fields
codepoint: Codepoint
The codepoint corresponding to this row.
name: String
The name of this codepoint.
general_category: String
The "general category" of this codepoint.
canonical_combining_class: u8
The class of this codepoint used in the Canonical Ordering Algorithm.
Note that some classes map to a particular symbol. See UAX44, Table 15.
bidi_class: String
The bidirectional class of this codepoint.
Possible values are listed in UAX44, Table 13.
decomposition: UnicodeDataDecomposition
The decomposition mapping for this codepoint. This includes its formatting tag (if present).
numeric_type_decimal: Option<u8>
A decimal numeric representation of this codepoint, if it has the
property Numeric_Type=Decimal.
numeric_type_digit: Option<u8>
A decimal numeric representation of this codepoint, if it has the
property Numeric_Type=Digit. Note that while this field is still
populated for existing codepoints, no new codepoints will have this
field populated.
numeric_type_numeric: Option<UnicodeDataNumeric>
A decimal or rational numeric representation of this codepoint, if it
has the property Numeric_Type=Numeric.
bidi_mirrored: bool
A boolean indicating whether this codepoint is "mirrored" in bidirectional text.
unicode1_name: String
The "old" Unicode 1.0 or ISO 6429 name of this codepoint. Note that
this field is empty unless it is significantly different from
the name field.
iso_comment: String
The ISO 10464 comment field. This no longer contains any non-NULL values.
simple_uppercase_mapping: Option<Codepoint>
This codepoint's simple uppercase mapping, if it exists.
simple_lowercase_mapping: Option<Codepoint>
This codepoint's simple lowercase mapping, if it exists.
simple_titlecase_mapping: Option<Codepoint>
This codepoint's simple titlecase mapping, if it exists.
Methods
impl UnicodeData[src]
pub fn is_range_start(&self) -> bool[src]
Returns true if and only if this record corresponds to the start of a range.
pub fn is_range_end(&self) -> bool[src]
Returns true if and only if this record corresponds to the end of a range.
Trait Implementations
impl Clone for UnicodeData[src]
fn clone(&self) -> UnicodeData[src]
Returns a copy of the value. Read more
fn clone_from(&mut self, source: &Self)1.0.0[src]
Performs copy-assignment from source. Read more
impl Debug for UnicodeData[src]
fn fmt(&self, __arg_0: &mut Formatter) -> Result[src]
Formats the value using the given formatter. Read more
impl Default for UnicodeData[src]
fn default() -> UnicodeData[src]
Returns the "default value" for a type. Read more
impl Eq for UnicodeData[src]
impl PartialEq for UnicodeData[src]
fn eq(&self, __arg_0: &UnicodeData) -> bool[src]
This method tests for self and other values to be equal, and is used by ==. Read more
fn ne(&self, __arg_0: &UnicodeData) -> bool[src]
This method tests for !=.
impl UcdFile for UnicodeData[src]
fn relative_file_path() -> &'static Path[src]
The file path corresponding to this file, relative to the UCD directory. Read more
fn file_path<P: AsRef<Path>>(ucd_dir: P) -> PathBuf[src]
The full file path corresponding to this file given the UCD directory path. Read more
fn from_dir<P: AsRef<Path>>(
ucd_dir: P
) -> Result<UcdLineParser<File, Self>, Error>[src]
ucd_dir: P
) -> Result<UcdLineParser<File, Self>, Error>
Create an iterator over each record in this UCD file. Read more
impl UcdFileByCodepoint for UnicodeData[src]
ⓘImportant traits for CodepointIterfn codepoints(&self) -> CodepointIter[src]
Returns the codepoints associated with this record.
impl FromStr for UnicodeData[src]
type Err = Error
The associated error which can be returned from parsing.
fn from_str(line: &str) -> Result<UnicodeData, Error>[src]
Parses a string s to return a value of this type. Read more