Skip to main content

Module sitemap_parser

Module sitemap_parser 

Source
Expand description

Sitemap Parser Module

Zero-allocation streaming parser for XML sitemaps. Supports gzip compression and sitemap index recursion.

§Examples

use rust_scraper::infrastructure::crawler::SitemapParser;

let parser = SitemapParser::new();
let urls = parser.parse_from_url("https://example.com/sitemap.xml").await?;
println!("Found {} URLs", urls.len());

§Errors

Returns SitemapError if:

  • URL is invalid
  • HTTP request fails
  • XML parsing fails
  • No <loc> elements found

Structs§

SitemapConfig
Sitemap parser configuration (builder pattern)
SitemapConfigBuilder
Builder for SitemapConfig
SitemapParser
Zero-allocation streaming sitemap parser

Enums§

SitemapError
Sitemap parser errors

Type Aliases§

Result
Result type for sitemap operations