Expand description
Metadata extraction for halldyll-parser
This module handles extraction of:
- Title and meta tags
- OpenGraph metadata
- Twitter Card metadata
- Robots directives
- Canonical URLs
- Hreflang alternates
- Structured data (JSON-LD, Microdata)
- Favicon and icons
Functionsยง
- extract_
alternates - Extract alternate language links (hreflang)
- extract_
apple_ touch_ icon - Extract Apple touch icon
- extract_
base_ url - Extract base URL from
tag - extract_
canonical - Extract canonical URL
- extract_
charset - Extract charset from meta tag
- extract_
favicon - Extract favicon URL
- extract_
json_ ld - Extract JSON-LD structured data
- extract_
keywords - Extract keywords as a list
- extract_
language - Extract language from html lang attribute
- extract_
meta_ content - Extract content from a meta tag by name
- extract_
metadata - Extract all metadata from an HTML document
- extract_
microdata - Extract Microdata structured data
- extract_
opengraph - Extract OpenGraph metadata
- extract_
robots - Extract robots meta directives
- extract_
structured_ data - Extract all structured data from the document
- extract_
title - Extract page title
- extract_
twitter_ card - Extract Twitter Card metadata