Expand description
High-performance HTML to Markdown converter.
Built with html5ever for fast, memory-efficient HTML parsing.
§Optional inline image extraction
Enable the inline-images Cargo feature to collect embedded data URI images and inline SVG
assets alongside the produced Markdown.
Re-exports§
pub use types::AnnotationKind;pub use types::ConversionResult;pub use types::DocumentNode;pub use types::DocumentStructure;pub use types::GridCell;pub use types::NodeContent;pub use types::ProcessingWarning;pub use types::TableData;pub use types::TableGrid;pub use types::TextAnnotation;pub use types::WarningKind;pub use crate::error::ConversionError;pub use crate::error::Result;pub use crate::metadata::DEFAULT_MAX_STRUCTURED_DATA_SIZE;pub use crate::metadata::DocumentMetadata;pub use crate::metadata::HeaderMetadata;pub use crate::metadata::HtmlMetadata;pub use crate::metadata::ImageMetadata;pub use crate::metadata::ImageType;pub use crate::metadata::LinkMetadata;pub use crate::metadata::LinkType;pub use crate::metadata::MetadataConfig;pub use crate::metadata::MetadataConfigUpdate;pub use crate::metadata::StructuredData;pub use crate::metadata::StructuredDataType;pub use crate::metadata::TextDirection;pub use crate::options::CodeBlockStyle;pub use crate::options::ConversionOptions;pub use crate::options::ConversionOptionsBuilder;pub use crate::options::ConversionOptionsUpdate;pub use crate::options::HeadingStyle;pub use crate::options::HighlightStyle;pub use crate::options::LinkStyle;pub use crate::options::ListIndentType;pub use crate::options::NewlineStyle;pub use crate::options::OutputFormat;pub use crate::options::PreprocessingOptions;pub use crate::options::PreprocessingOptionsUpdate;pub use crate::options::PreprocessingPreset;pub use crate::options::WhitespaceMode;
Modules§
- error
- Error types for HTML to Markdown conversion.
- metadata
- Metadata extraction for HTML to Markdown conversion.
- options
- Configuration options for HTML to Markdown conversion.
- types
- Core types for structured HTML extraction results.
Functions§
- conversion_
options_ from_ json - Parse JSON string into
ConversionOptions. - conversion_
options_ update_ from_ json - Parse JSON string into partial
ConversionOptionsupdate. - convert
- Convert HTML to Markdown, returning a
ConversionResultwith content, metadata, images, and warnings. - metadata_
config_ from_ json - Parse JSON string into
MetadataConfig(requiresmetadatafeature).