Skip to main content

Module item

Module item 

Source
Expand description

Data structures for scraped items and spider output in spider-lib.

This module defines the ScrapedItem trait, which is the core abstraction for any data extracted by a web spider. Implementors of this trait define the shape of the data they wish to collect.

Additionally, the ParseOutput struct is provided as the standard return type for a spider’s parse method. It encapsulates both the ScrapedItems found on a page and any new Requests that should be scheduled for crawling. This allows spiders to not only extract data but also to discover and follow new links within the same processing step.

Structs§

ParseOutput
The output of a spider’s parse method.

Traits§

ScrapedItem
A trait representing a scraped item.