pub struct SimdTokenizer { /* private fields */ }Expand description
SIMD-optimized tokenization utilities for improved performance
Implementations§
Source§impl SimdTokenizer
impl SimdTokenizer
Sourcepub fn classify_ascii_chars(&self, text: &[u8]) -> Vec<u8> ⓘ
pub fn classify_ascii_chars(&self, text: &[u8]) -> Vec<u8> ⓘ
Fast ASCII character classification using SIMD
Sourcepub fn find_whitespace_boundaries(&self, text: &[u8]) -> Vec<usize>
pub fn find_whitespace_boundaries(&self, text: &[u8]) -> Vec<usize>
Fast whitespace detection using SIMD
Sourcepub fn validate_utf8_fast(&self, bytes: &[u8]) -> Result<()>
pub fn validate_utf8_fast(&self, bytes: &[u8]) -> Result<()>
Fast byte-to-UTF8 validation using SIMD
Trait Implementations§
Auto Trait Implementations§
impl Freeze for SimdTokenizer
impl RefUnwindSafe for SimdTokenizer
impl Send for SimdTokenizer
impl Sync for SimdTokenizer
impl Unpin for SimdTokenizer
impl UnsafeUnpin for SimdTokenizer
impl UnwindSafe for SimdTokenizer
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more