Expand description
HTML to UCM document translator.
This crate provides translation from HTML documents to UCM’s block-based document model. It extracts semantic structure from HTML elements and creates appropriate blocks with proper hierarchy.
§Example
use ucp_translator_html::{HtmlParser, HtmlParserConfig};
let html = r#"<html><body>
<h1>Title</h1>
<p>Some content here.</p>
</body></html>"#;
let parser = HtmlParser::new();
let doc = parser.parse(html).unwrap();Structs§
- Html
Parser - HTML to UCM document parser
- Html
Parser Config - Configuration for HTML parsing
Enums§
- Heading
Strategy - Strategy for handling heading levels
- Html
Error - HTML translation error
Functions§
- parse_
html - Parse HTML string into a UCM Document.
Type Aliases§
- Result
- Result type for HTML translation