[−][src]Struct punkt::Trainer
A trainer will build data about abbreviations, sentence starters, collocations, and context that tokens appear in. The data is used by the sentence tokenizer to determine if a period is likely part of an abbreviation, or actually marks the termination of a sentence.
Methods
impl<P> Trainer<P> where
P: TrainerParameters + DefinesNonPrefixCharacters + DefinesNonWordCharacters,
[src]
P: TrainerParameters + DefinesNonPrefixCharacters + DefinesNonWordCharacters,
pub fn new() -> Trainer<P>
[src]
Creates a new Trainer.
pub fn train(&self, doc: &str, data: &mut TrainingData)
[src]
Train on a document. Does tokenization using a WordTokenizer.
Auto Trait Implementations
Blanket Implementations
impl<T> From for T
[src]
impl<T, U> Into for T where
U: From<T>,
[src]
U: From<T>,
impl<T, U> TryFrom for T where
U: Into<T>,
[src]
U: Into<T>,
type Error = !
🔬 This is a nightly-only experimental API. (
try_from
)The type returned in the event of a conversion error.
fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>
[src]
impl<T> Borrow for T where
T: ?Sized,
[src]
T: ?Sized,
impl<T> Any for T where
T: 'static + ?Sized,
[src]
T: 'static + ?Sized,
impl<T> BorrowMut for T where
T: ?Sized,
[src]
T: ?Sized,
fn borrow_mut(&mut self) -> &mut T
[src]
impl<T, U> TryInto for T where
U: TryFrom<T>,
[src]
U: TryFrom<T>,