pub struct AdvancedExtractor;Expand description
Advanced content extractor with article extraction and language detection
Implementations§
Source§impl AdvancedExtractor
impl AdvancedExtractor
Sourcepub fn extract_article(html: &str, url: &str) -> Result<ArticleContent>
pub fn extract_article(html: &str, url: &str) -> Result<ArticleContent>
Extract article content using Mozilla’s Readability algorithm
Sourcepub fn generate_excerpt(text: &str) -> Option<String>
pub fn generate_excerpt(text: &str) -> Option<String>
Generate a smart excerpt with sentence boundary detection
Sourcepub fn detect_language(text: &str) -> Option<String>
pub fn detect_language(text: &str) -> Option<String>
Detect language using whatlang
Sourcepub fn extract_tables_as_json(html: &str) -> Result<Vec<TableData>>
pub fn extract_tables_as_json(html: &str) -> Result<Vec<TableData>>
Extract tables as structured JSON with header mapping
Sourcepub fn count_words(text: &str) -> usize
pub fn count_words(text: &str) -> usize
Count words in text
Sourcepub fn estimate_reading_time(word_count: usize) -> usize
pub fn estimate_reading_time(word_count: usize) -> usize
Estimate reading time in minutes (assuming 200 words per minute)
Auto Trait Implementations§
impl Freeze for AdvancedExtractor
impl RefUnwindSafe for AdvancedExtractor
impl Send for AdvancedExtractor
impl Sync for AdvancedExtractor
impl Unpin for AdvancedExtractor
impl UnsafeUnpin for AdvancedExtractor
impl UnwindSafe for AdvancedExtractor
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more