Skip to main content

Crate fast_h2m

Crate fast_h2m 

Source
Expand description

High-performance HTML to Markdown converter.

Built with html5ever for fast, memory-efficient HTML parsing.

§Optional inline image extraction

Enable the inline-images Cargo feature to collect embedded data URI images and inline SVG assets alongside the produced Markdown.

Re-exports§

pub use types::AnnotationKind;
pub use types::ConversionResult;
pub use types::DocumentNode;
pub use types::DocumentStructure;
pub use types::GridCell;
pub use types::NodeContent;
pub use types::ProcessingWarning;
pub use types::TableData;
pub use types::TableGrid;
pub use types::TextAnnotation;
pub use types::WarningKind;
pub use crate::error::ConversionError;
pub use crate::error::Result;
pub use crate::metadata::DEFAULT_MAX_STRUCTURED_DATA_SIZE;
pub use crate::metadata::DocumentMetadata;
pub use crate::metadata::HeaderMetadata;
pub use crate::metadata::HtmlMetadata;
pub use crate::metadata::ImageMetadata;
pub use crate::metadata::ImageType;
pub use crate::metadata::LinkMetadata;
pub use crate::metadata::LinkType;
pub use crate::metadata::MetadataConfig;
pub use crate::metadata::MetadataConfigUpdate;
pub use crate::metadata::StructuredData;
pub use crate::metadata::StructuredDataType;
pub use crate::metadata::TextDirection;
pub use crate::options::CodeBlockStyle;
pub use crate::options::ConversionOptions;
pub use crate::options::ConversionOptionsBuilder;
pub use crate::options::ConversionOptionsUpdate;
pub use crate::options::HeadingStyle;
pub use crate::options::HighlightStyle;
pub use crate::options::LinkStyle;
pub use crate::options::ListIndentType;
pub use crate::options::NewlineStyle;
pub use crate::options::OutputFormat;
pub use crate::options::PreprocessingOptions;
pub use crate::options::PreprocessingOptionsUpdate;
pub use crate::options::PreprocessingPreset;
pub use crate::options::TierStrategy;
pub use crate::options::WhitespaceMode;

Modules§

error
Error types for HTML to Markdown conversion.
metadata
Metadata extraction for HTML to Markdown conversion.
options
Configuration options for HTML to Markdown conversion.
types
Core types for structured HTML extraction results.

Structs§

MarkdownStreamProcessor
Streaming HTML-to-Markdown converter backed by mdream.

Functions§

convert
Convert HTML to Markdown, returning a ConversionResult with content, metadata, images, and warnings.