sws-crawler 0.1.0

Web crawler with plugable scraping logic
Documentation

Web crawler with plugable scraping logic.

The main function crawl_site crawls and scraps web pages. It is configured through a CrawlerConfig and a Scrapable implementation. The latter defines the Seed used for crawling, as well as the scraping logic. Note that robots.txt seeds are supported and exposed through texting_robots::Robot in the CrawlingContext and ScrapingContext.