pub struct ExtractOptions<'a> {Show 23 fields
pub raw_html: &'a str,
pub source_url: &'a str,
pub status_code: u16,
pub rendered_with: Option<String>,
pub elapsed_ms: u64,
pub render_decision: Option<RenderDecision>,
pub credit_cost: u32,
pub warnings: Vec<String>,
pub formats: &'a [OutputFormat],
pub only_main_content: bool,
pub include_tags: &'a [String],
pub exclude_tags: &'a [String],
pub css_selector: Option<&'a str>,
pub xpath: Option<&'a str>,
pub chunk_strategy: Option<&'a ChunkStrategy>,
pub query: Option<&'a str>,
pub filter_mode: Option<&'a FilterMode>,
pub top_k: Option<usize>,
pub domain_selectors: Option<&'a HashMap<String, String>>,
pub captured_responses: &'a [CapturedNetworkResponse],
pub llm_fallback: Option<LlmFallbackParams<'a>>,
pub debug: bool,
pub debug_sink: Option<Arc<Mutex<DebugCollector>>>,
}Expand description
Options for the high-level extraction pipeline.
Fields§
§raw_html: &'a str§source_url: &'a str§status_code: u16§rendered_with: Option<String>§elapsed_ms: u64§render_decision: Option<RenderDecision>Routing decision metadata to surface to API consumers.
credit_cost: u32Credit cost attributed to this fetch.
warnings: Vec<String>Soft-failure warnings collected through the render chain.
formats: &'a [OutputFormat]§only_main_content: bool§css_selector: Option<&'a str>CSS selector to narrow content before readability extraction.
xpath: Option<&'a str>XPath expression to narrow content before readability extraction.
chunk_strategy: Option<&'a ChunkStrategy>Strategy for chunking the extracted markdown.
query: Option<&'a str>Query for chunk filtering (requires filter_mode).
filter_mode: Option<&'a FilterMode>Filtering algorithm for chunk ranking.
top_k: Option<usize>Number of top chunks to return (default: 5).
domain_selectors: Option<&'a HashMap<String, String>>Per-host CSS selector overrides. Used only when the request did not
supply an explicit css_selector / xpath. The selector for the
source URL’s host is applied before readability narrowing.
captured_responses: &'a [CapturedNetworkResponse]XHR/fetch responses captured during navigation. Used as a fallback content source when DOM-based extraction is low quality.
llm_fallback: Option<LlmFallbackParams<'a>>LLM-assisted extraction fallback configuration. When the chosen
candidate’s quality score is below quality_threshold and enable
is true, the raw HTML (truncated to max_html_bytes) is sent to the
configured LLM provider for re-extraction.
debug: boolOpt-in extraction debug trace. When true, the extractor populates
debug_sink with one DebugAttempt per extract() invocation.
debug_sink: Option<Arc<Mutex<DebugCollector>>>Sink for debug attempts. Shared across the multi-attempt JS-escalation loop so that all attempts land in one trace.
Auto Trait Implementations§
impl<'a> Freeze for ExtractOptions<'a>
impl<'a> RefUnwindSafe for ExtractOptions<'a>
impl<'a> Send for ExtractOptions<'a>
impl<'a> Sync for ExtractOptions<'a>
impl<'a> Unpin for ExtractOptions<'a>
impl<'a> UnsafeUnpin for ExtractOptions<'a>
impl<'a> UnwindSafe for ExtractOptions<'a>
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more