Functionsยง
- content_
root - Pick the element most likely to contain the primary article content.
- extract_
metadata - Extract citation-oriented metadata: description, author, publish date,
language, and site name (from standard
<meta>/OpenGraph tags). - extract_
title - Extract the page title from
<title>or the first<h1>.