pub struct TextExtractionConfig {
pub chunk_size: usize,
pub chunk_overlap: usize,
pub clean_control_chars: bool,
pub min_chunk_size: usize,
pub cleaning: Option<CleaningConfig>,
}Expand description
Text extraction and chunking configuration
Fields§
§chunk_size: usizeChunk size for text splitting
chunk_overlap: usizeOverlap between chunks
clean_control_chars: boolClean control characters
min_chunk_size: usizeMinimum chunk size to keep
cleaning: Option<CleaningConfig>Text cleaning options
Trait Implementations§
Source§impl Clone for TextExtractionConfig
impl Clone for TextExtractionConfig
Source§fn clone(&self) -> TextExtractionConfig
fn clone(&self) -> TextExtractionConfig
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for TextExtractionConfig
impl Debug for TextExtractionConfig
Source§impl Default for TextExtractionConfig
impl Default for TextExtractionConfig
Source§impl<'de> Deserialize<'de> for TextExtractionConfig
impl<'de> Deserialize<'de> for TextExtractionConfig
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for TextExtractionConfig
impl RefUnwindSafe for TextExtractionConfig
impl Send for TextExtractionConfig
impl Sync for TextExtractionConfig
impl Unpin for TextExtractionConfig
impl UnsafeUnpin for TextExtractionConfig
impl UnwindSafe for TextExtractionConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more