Module icu_segmenter::provider
source · Expand description
🚧 [Unstable] Data provider struct definitions for this ICU4X component.
🚧 This code is considered unstable; it may change at any time, in breaking or non-breaking ways,
including in SemVer minor releases. While the serde representation of data structs is guaranteed
to be stable, their Rust representation might not be. Use with caution.
Read more about data providers: icu_provider
Structs
- Marker type for
UCharDictionaryBreakDataV1
: “segmenter/dictionary/wl_ext@1” - Marker type for
UCharDictionaryBreakDataV1
: “segmenter/dictionary/w_auto@1” - Marker type for
RuleBreakDataV1
: “segmenter/grapheme@1” - Marker type for
RuleBreakDataV1
: “segmenter/line@1” - The struct that stores a LSTM model.
- Marker type for
LstmDataV1
: “segmenter/lstm/wl_auto@1” - The struct that stores a LSTM’s matrix.
- The struct that stores a LSTM’s matrix.
- The struct that stores a LSTM’s matrix.
- Pre-processed Unicode data in the form of tables to be used for rule-based breaking.
- Property table for rule-based breaking.
- Break state table for rule-based breaking.
- Rules status data for rule_status and is_word_like of word segmenter.
- Marker type for
RuleBreakDataV1
: “segmenter/sentence@1” - char16trie data for dictionary break
- Marker type for
RuleBreakDataV1
: “segmenter/word@1”
Enums
- The data to power the LSTM segmentation model.
- The type of LSTM model
- The category tag that is returned by
WordBreakIterator::word_type()
.