Module metadata

Module metadata 

Source
Expand description

Metadata extraction for halldyll-parser

This module handles extraction of:

  • Title and meta tags
  • OpenGraph metadata
  • Twitter Card metadata
  • Robots directives
  • Canonical URLs
  • Hreflang alternates
  • Structured data (JSON-LD, Microdata)
  • Favicon and icons

Functionsยง

extract_alternates
Extract alternate language links (hreflang)
extract_apple_touch_icon
Extract Apple touch icon
extract_base_url
Extract base URL from tag
extract_canonical
Extract canonical URL
extract_charset
Extract charset from meta tag
extract_favicon
Extract favicon URL
extract_json_ld
Extract JSON-LD structured data
extract_keywords
Extract keywords as a list
extract_language
Extract language from html lang attribute
extract_meta_content
Extract content from a meta tag by name
extract_metadata
Extract all metadata from an HTML document
extract_microdata
Extract Microdata structured data
extract_opengraph
Extract OpenGraph metadata
extract_robots
Extract robots meta directives
extract_structured_data
Extract all structured data from the document
extract_title
Extract page title
extract_twitter_card
Extract Twitter Card metadata