kawat-xpath 0.1.1

XPath evaluation on HTML documents for kawat
Documentation

kawat-xpath

XPath evaluation engine for HTML documents in the kawat web content extraction library.

Provides a wrapper around sxd_html + sxd_xpath for evaluating XPath 1.0 expressions on HTML.

Features

  • XPath 1.0 support: Full XPath 1.0 expression evaluation on HTML
  • Pre-compiled expressions: Curated XPath expressions from trafilatura's xpaths.py
  • CSS selector fallback: Alternative CSS selector evaluation path
  • No unsafe code: Safe Rust implementation

License

Apache-2.0