Expand description
Trait for defining custom web spiders in the spider-lib framework.
This module provides the Spider trait, which serves as the blueprint
for creating custom web scrapers. A spider defines how a specific website
(or a group of websites) should be crawled and how data should be extracted.
Implementors of the Spider trait must:
- Specify the
Itemtype (the data structure for scraped data). - Provide a list of
start_urlsorstart_requeststo begin the crawl. - Implement the
parsemethod, which takes aResponseand returnsParseOutputcontaining newRequests to follow andScrapedItems.
Traitsยง
- Spider
- Defines the contract for a web spider.