Skip to main content

Module html

Module html 

Source
Expand description

HTML processing module

This module provides functions for fetching, parsing, and processing HTML content.

Functions§

convert_relative_urls
Convert relative URLs to absolute URLs in HTML content
convert_to_utf8
Convert HTML content to UTF-8 encoding
decode_html_entities
Decode HTML entities to unicode characters.
fetch_html
Fetch HTML content from a URL
has_javascript
Check if HTML content contains JavaScript
is_html
Check if content is valid HTML
normalize_url
Normalize URL to ensure it’s absolute.
pretty_print_html
Pretty-print HTML with indentation.