pub struct ContentExtractionConfig {Show 15 fields
pub extract_text: bool,
pub extract_metadata: bool,
pub extract_images: bool,
pub extract_tables: bool,
pub extract_links: bool,
pub max_content_length: usize,
pub preserve_structure: bool,
pub extract_page_info: bool,
pub detect_language: bool,
pub chunking_strategy: ChunkingStrategy,
pub extract_multimedia_features: bool,
pub generate_image_embeddings: bool,
pub extract_audio_features: bool,
pub extract_video_features: bool,
pub max_image_resolution: Option<(u32, u32)>,
}Expand description
Content extraction configuration
Fields§
§extract_text: boolExtract text content
extract_metadata: boolExtract metadata
extract_images: boolExtract images
extract_tables: boolExtract tables
extract_links: boolExtract links
max_content_length: usizeMaximum content length to extract
preserve_structure: boolPreserve document structure
extract_page_info: boolExtract page/section information
detect_language: boolLanguage detection
chunking_strategy: ChunkingStrategyContent chunking strategy
extract_multimedia_features: boolExtract multimedia features (image analysis, audio analysis, etc.)
generate_image_embeddings: boolGenerate image embeddings using computer vision models
extract_audio_features: boolExtract audio features and generate embeddings
extract_video_features: boolExtract video keyframes and generate embeddings
max_image_resolution: Option<(u32, u32)>Maximum image processing resolution
Trait Implementations§
Source§impl Clone for ContentExtractionConfig
impl Clone for ContentExtractionConfig
Source§fn clone(&self) -> ContentExtractionConfig
fn clone(&self) -> ContentExtractionConfig
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for ContentExtractionConfig
impl Debug for ContentExtractionConfig
Source§impl Default for ContentExtractionConfig
impl Default for ContentExtractionConfig
Source§impl<'de> Deserialize<'de> for ContentExtractionConfig
impl<'de> Deserialize<'de> for ContentExtractionConfig
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for ContentExtractionConfig
impl RefUnwindSafe for ContentExtractionConfig
impl Send for ContentExtractionConfig
impl Sync for ContentExtractionConfig
impl Unpin for ContentExtractionConfig
impl UnwindSafe for ContentExtractionConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> Pointable for T
impl<T> Pointable for T
Source§impl<T> PolicyExt for Twhere
T: ?Sized,
impl<T> PolicyExt for Twhere
T: ?Sized,
Source§impl<R, P> ReadPrimitive<R> for P
impl<R, P> ReadPrimitive<R> for P
Source§fn read_from_little_endian(read: &mut R) -> Result<Self, Error>
fn read_from_little_endian(read: &mut R) -> Result<Self, Error>
Read this value from the supplied reader. Same as
ReadEndian::read_from_little_endian().Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
The inverse inclusion map: attempts to construct
self from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
Checks if
self is actually part of its subset T (and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
Use with care! Same as
self.to_subset but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
The inclusion map: converts
self to the equivalent element of its superset.