Expand description
High-performance HTML to Markdown converter.
Built with html5ever for fast, memory-efficient HTML parsing.
§Optional inline image extraction
Enable the inline-images Cargo feature to collect embedded data URI images and inline SVG
assets alongside the produced Markdown.
Re-exports§
pub use crate::error::ConversionError;pub use crate::error::Result;pub use crate::metadata::DEFAULT_MAX_STRUCTURED_DATA_SIZE;pub use crate::metadata::DocumentMetadata;pub use crate::metadata::ExtendedMetadata;pub use crate::metadata::HeaderMetadata;pub use crate::metadata::ImageMetadata;pub use crate::metadata::ImageType;pub use crate::metadata::LinkMetadata;pub use crate::metadata::LinkType;pub use crate::metadata::MetadataConfig;pub use crate::metadata::MetadataConfigUpdate;pub use crate::metadata::StructuredData;pub use crate::metadata::StructuredDataType;pub use crate::metadata::TextDirection;pub use crate::options::CodeBlockStyle;pub use crate::options::ConversionOptions;pub use crate::options::ConversionOptionsUpdate;pub use crate::options::HeadingStyle;pub use crate::options::HighlightStyle;pub use crate::options::ListIndentType;pub use crate::options::NewlineStyle;pub use crate::options::OutputFormat;pub use crate::options::PreprocessingOptions;pub use crate::options::PreprocessingOptionsUpdate;pub use crate::options::PreprocessingPreset;pub use crate::options::WhitespaceMode;
Modules§
- converter
- HTML to Markdown conversion engine with modular architecture.
- error
- Error types for HTML to Markdown conversion.
- hocr
- hOCR 1.2 document processing.
- metadata
- Metadata extraction for HTML to Markdown conversion.
- options
- Configuration options for HTML to Markdown conversion.
- prelude
- Prelude module for convenient imports.
- safety
- Helpers to keep binding entrypoints panic-safe.
- text
- Text processing utilities for Markdown conversion.
- wrapper
- Text wrapping functionality for Markdown output.
Functions§
- conversion_
options_ from_ json - Parse JSON string into
ConversionOptions. - conversion_
options_ update_ from_ json - Parse JSON string into partial
ConversionOptionsupdate. - convert
- Convert HTML to Markdown.
- convert_
with_ metadata - Convert HTML to Markdown with comprehensive metadata extraction (requires the
metadatafeature). - metadata_
config_ from_ json - Parse JSON string into
MetadataConfig(requiresmetadatafeature).