spider-pipeline
Item pipelines for processing, filtering, and exporting scraped data in spider-lib.
Use this crate directly when you want pipeline features without bringing the full facade crate.
Installation
[]
= "0.3.4"
Built-in Pipelines
Core (always available):
ConsolePipelineDeduplicationPipeline
Optional (feature-gated):
pipeline-json->JsonPipelinepipeline-jsonl->JsonlPipelinepipeline-csv->CsvPipelinepipeline-sqlite->SqlitePipelinepipeline-stream-json->StreamJsonPipeline
Usage
use ;
let crawler = new
.add_pipeline
.add_pipeline
.build
.await?;
Feature Flags
core(default)pipeline-csvpipeline-jsonpipeline-jsonlpipeline-sqlitepipeline-stream-json
[]
= { = "0.3.4", = ["pipeline-jsonl", "pipeline-csv"] }
When using via spider-lib, enable root features with the same names.
Related Crates
License
MIT. See LICENSE.