Skip to main content

Crate html_to_markdown_rs

Crate html_to_markdown_rs 

Source
Expand description

High-performance HTML to Markdown converter.

Built with html5ever for fast, memory-efficient HTML parsing.

§Optional inline image extraction

Enable the inline-images Cargo feature to collect embedded data URI images and inline SVG assets alongside the produced Markdown.

Re-exports§

pub use crate::error::ConversionError;
pub use crate::error::Result;
pub use crate::metadata::DEFAULT_MAX_STRUCTURED_DATA_SIZE;
pub use crate::metadata::DocumentMetadata;
pub use crate::metadata::ExtendedMetadata;
pub use crate::metadata::HeaderMetadata;
pub use crate::metadata::ImageMetadata;
pub use crate::metadata::ImageType;
pub use crate::metadata::LinkMetadata;
pub use crate::metadata::LinkType;
pub use crate::metadata::MetadataConfig;
pub use crate::metadata::MetadataConfigUpdate;
pub use crate::metadata::StructuredData;
pub use crate::metadata::StructuredDataType;
pub use crate::metadata::TextDirection;
pub use crate::options::CodeBlockStyle;
pub use crate::options::ConversionOptions;
pub use crate::options::ConversionOptionsUpdate;
pub use crate::options::HeadingStyle;
pub use crate::options::HighlightStyle;
pub use crate::options::ListIndentType;
pub use crate::options::NewlineStyle;
pub use crate::options::OutputFormat;
pub use crate::options::PreprocessingOptions;
pub use crate::options::PreprocessingOptionsUpdate;
pub use crate::options::PreprocessingPreset;
pub use crate::options::WhitespaceMode;

Modules§

converter
HTML to Markdown conversion engine with modular architecture.
error
Error types for HTML to Markdown conversion.
hocr
hOCR 1.2 document processing.
metadata
Metadata extraction for HTML to Markdown conversion.
options
Configuration options for HTML to Markdown conversion.
prelude
Prelude module for convenient imports.
safety
Helpers to keep binding entrypoints panic-safe.
text
Text processing utilities for Markdown conversion.
wrapper
Text wrapping functionality for Markdown output.

Functions§

conversion_options_from_json
Parse JSON string into ConversionOptions.
conversion_options_update_from_json
Parse JSON string into partial ConversionOptions update.
convert
Convert HTML to Markdown.
convert_with_metadata
Convert HTML to Markdown with comprehensive metadata extraction (requires the metadata feature).
metadata_config_from_json
Parse JSON string into MetadataConfig (requires metadata feature).