Skip to main content

extract_from_patterns

Function extract_from_patterns 

Source
pub fn extract_from_patterns(html: &str, base_url: &str) -> PatternResult
Expand description

Extract data from raw HTML using CSS selectors and regex patterns.

This is the main entry point for the pattern engine. It parses the HTML document once and then runs each extraction pass (price, rating, availability, page type, actions) in sequence, stopping each pass at the first confident match.

§Arguments

  • html - Raw HTML string to parse.
  • base_url - Base URL for resolving relative form action URLs.

§Returns

A PatternResult with all discovered data. Fields that could not be extracted are None.