Expand description
Data structures for scraped items and spider output in spider-lib.
This module defines the ScrapedItem trait, which is the core abstraction
for any data extracted by a web spider. Implementors of this trait define
the shape of the data they wish to collect.
Additionally, the ParseOutput struct is provided as the standard return type
for a spider’s parse method. It encapsulates both the ScrapedItems
found on a page and any new Requests that should be scheduled for crawling.
This allows spiders to not only extract data but also to discover and
follow new links within the same processing step.
Structs§
- Parse
Output - The output of a spider’s
parsemethod.
Traits§
- Scraped
Item - A trait representing a scraped item.