pub struct CanonicalizeConfig {
pub version: u32,
pub normalize_unicode: bool,
pub strip_punctuation: bool,
pub lowercase: bool,
}Expand description
Configuration for the canonical text pipeline.
version is a monotonically increasing schema version for the
canonical layer. Any behavior change that can affect canonical text,
tokenization, or canonical hashes must be accompanied by a new
configuration version.
Fields§
§version: u32Semantic version of the canonicalization configuration.
normalize_unicode: boolIf true, apply Unicode NFKC normalization before other transforms.
strip_punctuation: boolIf true, strip punctuation characters before tokenizing.
lowercase: boolIf true, lowercase the text.
Trait Implementations§
Source§impl Clone for CanonicalizeConfig
impl Clone for CanonicalizeConfig
Source§fn clone(&self) -> CanonicalizeConfig
fn clone(&self) -> CanonicalizeConfig
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for CanonicalizeConfig
impl Debug for CanonicalizeConfig
Source§impl Default for CanonicalizeConfig
impl Default for CanonicalizeConfig
Source§impl<'de> Deserialize<'de> for CanonicalizeConfig
impl<'de> Deserialize<'de> for CanonicalizeConfig
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for CanonicalizeConfig
impl PartialEq for CanonicalizeConfig
Source§impl Serialize for CanonicalizeConfig
impl Serialize for CanonicalizeConfig
impl Eq for CanonicalizeConfig
impl StructuralPartialEq for CanonicalizeConfig
Auto Trait Implementations§
impl Freeze for CanonicalizeConfig
impl RefUnwindSafe for CanonicalizeConfig
impl Send for CanonicalizeConfig
impl Sync for CanonicalizeConfig
impl Unpin for CanonicalizeConfig
impl UnwindSafe for CanonicalizeConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more