[][src]Struct notmecab::LexerToken

pub struct LexerToken {
    pub cost: i64,
    pub real_cost: i64,
    pub range: Range<usize>,
    pub kind: TokenType,
    pub original_id: u32,
    pub feature_offset: u32,
    // some fields omitted
}

Fields

cost: i64

Used internally during lattice pathfinding.

real_cost: i64

Cost updated to include right-edge connection cost after parsing.

range: Range<usize>

The range, in bytes, to which this token corresponds to in the original text.

kind: TokenType

Origin of token. BOS and UNK are virtual origins ("beginning/ending-of-string" and "unknown", respectively). Normal means it came from the mecab dictionary.

The BOS (beginning/ending-of-string) tokens are stripped away in parse_to_lexertokens.

original_id: u32

Unique identifier of what specific lexeme realization this is, from the mecab dictionary. changes between dictionary versions.

feature_offset: u32

Implementations

impl LexerToken[src]

pub fn get_text<'a>(&self, whole_text: &'a str) -> &'a str[src]

Returns the text to which this token corresponds to in the original text.

The whole_text is the original string for which you've called Dict::tokenize or Dict::tokenize_with_cache.

pub fn get_feature<'a>(&self, dict: &'a Dict) -> &'a str[src]

Returns a feature string corresponding to this token.

Feature strings are dictionary-specific so unfortunately you need to parse them yourself. They usually contain things like the exact part-of-speech this token represents, its reading, whenever it's conjugated or not, etc.

Trait Implementations

impl Clone for LexerToken[src]

impl Debug for LexerToken[src]

Auto Trait Implementations

Blanket Implementations

impl<T> Any for T where
    T: 'static + ?Sized
[src]

impl<T> Borrow<T> for T where
    T: ?Sized
[src]

impl<T> BorrowMut<T> for T where
    T: ?Sized
[src]

impl<T> From<T> for T[src]

impl<T, U> Into<U> for T where
    U: From<T>, 
[src]

impl<T> ToOwned for T where
    T: Clone
[src]

type Owned = T

The resulting type after obtaining ownership.

impl<T, U> TryFrom<U> for T where
    U: Into<T>, 
[src]

type Error = Infallible

The type returned in the event of a conversion error.

impl<T, U> TryInto<U> for T where
    U: TryFrom<T>, 
[src]

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.