Expand description
Type definitions for halldyll-parser
This module contains all public types used throughout the parser:
- Error types
- Content types (text, headings, lists, tables, etc.)
- Metadata types (OpenGraph, Twitter Cards, etc.)
- Structured data types (JSON-LD, Microdata, RDFa)
Structs§
- Alternate
Link - Alternate language version (hreflang)
- Code
Block - An extracted code block
- Heading
- A heading element (h1-h6)
- Image
- An extracted image
- Link
- An extracted link
- List
Content - An extracted list
- List
Item - A list item (may contain nested content)
- Open
Graph - OpenGraph metadata
- Page
Metadata - Complete page metadata
- Parse
Stats - Parsing statistics
- Parsed
Content - Complete parsed content from an HTML document
- Parser
Config - Configuration for the HTML parser
- Quote
- An extracted blockquote
- Robots
Meta - Robots meta directives
- Structured
Data - Extracted structured data item
- Table
Cell - A table cell
- Table
Content - An extracted table
- Table
Row - A table row
- Text
Content - Extracted text content with metadata
- Twitter
Card - Twitter Card metadata
Enums§
- Image
Loading - Image loading strategy
- LinkRel
- Relationship types for links
- Link
Type - Type of link (internal vs external)
- List
Type - Type of list
- Parser
Error - Errors that can occur during HTML parsing
- Structured
Data Format - Type of structured data
Functions§
- clean_
text - Clean text by removing control characters
- normalize_
whitespace - Normalize whitespace in text
- truncate_
text - Truncate text to max length with ellipsis
Type Aliases§
- Parser
Result - Result type for parser operations