Skip to main content

Module models

Module models 

Source
Expand description

Data model types for EdgeParse.

Implements the type hierarchy from Spec 05.

Modules§

bbox
BoundingBox — Core geometry type for all positioned PDF elements.
chunks
Chunk types — atomic units of extracted content.
content
Unified ContentElement enum — all page content.
document
PdfDocument — top-level extracted document.
enums
Enumerations for EdgeParse data models.
list
List structures — PDFList, ListItem.
semantic
Semantic node types — SemanticParagraph, SemanticHeading, etc.
table
Table structures — TableBorder, TableBorderRow, TableBorderCell.
text
Text grouping types — TextLine, TextBlock, TextColumn.