Module parse

Module parse 

Source
Expand description

Parse - Content extraction (text, links, images, videos, metadata)

Re-exports§

pub use router::ContentRouter;
pub use router::ContentType as ParsedContentType;
pub use html::HtmlParser;
pub use text::TextExtractor;
pub use links::LinkExtractor;
pub use images::ImageExtractor;
pub use videos::VideoExtractor;
pub use audios::AudioExtractor;
pub use metadata::MetadataExtractor;
pub use jsonld::JsonLdExtractor;
pub use opengraph::OpenGraphExtractor;

Modules§

audios
Audios - Audio extraction
html
HTML - Robust HTML parsing
images
Images - Image extraction
jsonld
JSON-LD - JSON-LD structured data extraction
links
Links - Link extraction
metadata
Metadata - Metadata extraction
opengraph
OpenGraph - OpenGraph metadata extraction
router
Router - Routing by Content-Type
text
Text - Main text extraction (boilerplate removal)
videos
Videos - Video extraction