Expand description
Parse - Content extraction (text, links, images, videos, metadata)
Re-exports§
pub use router::ContentRouter;pub use router::ContentType as ParsedContentType;pub use html::HtmlParser;pub use text::TextExtractor;pub use links::LinkExtractor;pub use images::ImageExtractor;pub use videos::VideoExtractor;pub use audios::AudioExtractor;pub use metadata::MetadataExtractor;pub use jsonld::JsonLdExtractor;pub use opengraph::OpenGraphExtractor;
Modules§
- audios
- Audios - Audio extraction
- html
- HTML - Robust HTML parsing
- images
- Images - Image extraction
- jsonld
- JSON-LD - JSON-LD structured data extraction
- links
- Links - Link extraction
- metadata
- Metadata - Metadata extraction
- opengraph
- OpenGraph - OpenGraph metadata extraction
- router
- Router - Routing by Content-Type
- text
- Text - Main text extraction (boilerplate removal)
- videos
- Videos - Video extraction