Expand description
§spider-pipeline
Item pipelines for cleanup, validation, deduplication, and output.
Pipelines run after parsing. This crate contains both in-memory stages such as transforms and validators, and output backends such as CSV, JSON, SQLite, and streaming JSON.
§Example
ⓘ
use spider_pipeline::json::JsonPipeline;
use spider_pipeline::console::ConsolePipeline;
let crawler = CrawlerBuilder::new(MySpider)
.add_pipeline(JsonPipeline::new("output.json")?)
.add_pipeline(ConsolePipeline::new())
.build()
.await?;