pub struct JaKoTokenizer { /* private fields */ }Expand description
Splits phrases using Japanese/Korean word segmentation.
Each token emitted by this will also contain JaKoTokenMetadata in its Token::metadata.
This uses lindera::tokenizer::Tokenizer.
Implementations§
Source§impl JaKoTokenizer
impl JaKoTokenizer
Sourcepub fn from_dictionary(
dictionary: Dictionary,
user_dictionary: Option<UserDictionary>,
) -> Self
pub fn from_dictionary( dictionary: Dictionary, user_dictionary: Option<UserDictionary>, ) -> Self
Creates a new Japanese/Korean tokenizer using the given lindera::dictionary::Dictionary and default segmenter settings.
Sourcepub fn from_segmenter(segmenter: Segmenter) -> Self
pub fn from_segmenter(segmenter: Segmenter) -> Self
Creates a new Japanese/Korean tokenizer using the given lindera::segmenter::Segmenter.
Trait Implementations§
Auto Trait Implementations§
impl Freeze for JaKoTokenizer
impl RefUnwindSafe for JaKoTokenizer
impl Send for JaKoTokenizer
impl Sync for JaKoTokenizer
impl Unpin for JaKoTokenizer
impl UnwindSafe for JaKoTokenizer
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more