scrapling-spider 0.2.0

Concurrent web crawler framework for scrapling
Documentation

scrapling-spider

Concurrent web crawler framework for scrapling-rs.

Features

  • Concurrent crawling with configurable parallelism
  • Request deduplication via SHA-1 fingerprinting
  • Robots.txt compliance with crawl-delay support
  • Checkpoint/resume for long-running crawls
  • Development mode with response caching
  • Built on scrapling-fetch for TLS impersonation and scrapling-browser for JS-rendered pages

License

MIT