Skip to main content

Crate html_to_markdown_rs

Crate html_to_markdown_rs 

Source
Expand description

High-performance HTML to Markdown converter.

Built with html5ever for fast, memory-efficient HTML parsing.

§Optional inline image extraction

Enable the inline-images Cargo feature to collect embedded data URI images and inline SVG assets alongside the produced Markdown.

Re-exports§

pub use types::AnnotationKind;
pub use types::ConversionResult;
pub use types::DocumentNode;
pub use types::DocumentStructure;
pub use types::GridCell;
pub use types::NodeContent;
pub use types::ProcessingWarning;
pub use types::TableData;
pub use types::TableGrid;
pub use types::TextAnnotation;
pub use types::WarningKind;
pub use crate::error::ConversionError;
pub use crate::error::Result;
pub use crate::metadata::DEFAULT_MAX_STRUCTURED_DATA_SIZE;
pub use crate::metadata::DocumentMetadata;
pub use crate::metadata::HeaderMetadata;
pub use crate::metadata::HtmlMetadata;
pub use crate::metadata::ImageMetadata;
pub use crate::metadata::ImageType;
pub use crate::metadata::LinkMetadata;
pub use crate::metadata::LinkType;
pub use crate::metadata::MetadataConfig;
pub use crate::metadata::MetadataConfigUpdate;
pub use crate::metadata::StructuredData;
pub use crate::metadata::StructuredDataType;
pub use crate::metadata::TextDirection;
pub use crate::options::CodeBlockStyle;
pub use crate::options::ConversionOptions;
pub use crate::options::ConversionOptionsBuilder;
pub use crate::options::ConversionOptionsUpdate;
pub use crate::options::HeadingStyle;
pub use crate::options::HighlightStyle;
pub use crate::options::LinkStyle;
pub use crate::options::ListIndentType;
pub use crate::options::NewlineStyle;
pub use crate::options::OutputFormat;
pub use crate::options::PreprocessingOptions;
pub use crate::options::PreprocessingOptionsUpdate;
pub use crate::options::PreprocessingPreset;
pub use crate::options::WhitespaceMode;

Modules§

error
Error types for HTML to Markdown conversion.
metadata
Metadata extraction for HTML to Markdown conversion.
options
Configuration options for HTML to Markdown conversion.
types
Core types for structured HTML extraction results.

Functions§

conversion_options_from_json
Parse JSON string into ConversionOptions.
conversion_options_update_from_json
Parse JSON string into partial ConversionOptions update.
convert
Convert HTML to Markdown, returning a ConversionResult with content, metadata, images, and warnings.
metadata_config_from_json
Parse JSON string into MetadataConfig (requires metadata feature).