Skip to main content

Module metadata

Module metadata 

Source
Expand description

Article metadata extraction module (R1).

Extracts metadata from web pages including:

  • Author information (name, URL, karma)
  • Publication date and modification date
  • Reading time and difficulty
  • Views, votes, bookmarks, comments
  • Hubs and tags (with URLs)
  • Translation information
  • LD+JSON structured data

Based on reference implementation from: https://github.com/link-foundation/meta-theory/blob/main/scripts/download-article.mjs

Structs§

ArticleMetadata
Extracted article metadata.
NamedLink
Link with name and URL.

Functions§

extract_metadata
Extract article metadata from HTML.
format_footer_block
Format footer metadata block.
format_metadata_block
Format metadata as a markdown header block.