Expand description
XPath evaluation on HTML documents.
Provides a wrapper around sxd_html + sxd_xpath for evaluating
XPath 1.0 expressions on HTML, with scraper as CSS-selector fallback.
All XPath expressions from trafilatura’s xpaths.py use a narrow subset:
contains(), starts-with(), translate(), self::, [1] positional,
or/and, and attribute tests. No axes beyond descendant/child.
Structs§
- Compiled
Xpaths - All pre-compiled XPath expression groups used in extraction.
- Xpath
Engine - Evaluates XPath expressions against HTML documents.
- Xpath
Expr - Pre-compiled XPath expression group (thread-safe, reusable).