Crate html_to_markdown_rs

Crate html_to_markdown_rs 

Source
Expand description

High-performance HTML to Markdown converter.

Built with html5ever for fast, memory-efficient HTML parsing.

§Optional inline image extraction

Enable the inline-images Cargo feature to collect embedded data URI images and inline SVG assets alongside the produced Markdown.

Re-exports§

pub use error::ConversionError;
pub use error::Result;
pub use metadata::DEFAULT_MAX_STRUCTURED_DATA_SIZE;
pub use metadata::DocumentMetadata;
pub use metadata::ExtendedMetadata;
pub use metadata::HeaderMetadata;
pub use metadata::ImageMetadata;
pub use metadata::ImageType;
pub use metadata::LinkMetadata;
pub use metadata::LinkType;
pub use metadata::MetadataConfig;
pub use metadata::StructuredData;
pub use metadata::StructuredDataType;
pub use metadata::TextDirection;
pub use options::CodeBlockStyle;
pub use options::ConversionOptions;
pub use options::HeadingStyle;
pub use options::HighlightStyle;
pub use options::ListIndentType;
pub use options::NewlineStyle;
pub use options::PreprocessingOptions;
pub use options::PreprocessingPreset;
pub use options::WhitespaceMode;

Modules§

converter
HTML to Markdown conversion using the astral-tl parser.
error
Error types for HTML to Markdown conversion.
hocr
hOCR 1.2 document processing.
metadata
Metadata extraction for HTML to Markdown conversion.
options
Configuration options for HTML to Markdown conversion.
safety
Helpers to keep binding entrypoints panic-safe.
text
Text processing utilities for Markdown conversion.
wrapper
Text wrapping functionality for Markdown output.

Functions§

convert
Convert HTML to Markdown.
convert_with_metadata
Convert HTML to Markdown with comprehensive metadata extraction (requires the metadata feature).