Skip to main content

Module html

Module html 

Source
Expand description

HTML processing utilities

Provides HTML cleaning and conversion functions for documentation extraction. Uses the scraper crate for robust HTML5 parsing.

Functions§

clean_html
Clean HTML by removing unwanted tags and their content
extract_documentation
Extract documentation from HTML by cleaning and converting to Markdown
extract_documentation_as_text
Extract documentation from HTML as plain text.
extract_documentation_html
Extract documentation from HTML as cleaned HTML.
extract_search_results
Extract search results from HTML
html_to_text
Convert HTML to plain text by removing all HTML tags
is_item_fallback_page
Determine whether a resolved rustdoc page is a fallback rather than the dedicated page for item_path.
page_h1_text
Extract the collapsed text of the page’s primary <h1> heading.