Expand description
PDF parsing module.
Structs§
- Column
- A detected column in the page layout.
- Detected
Table - A detected table region with its content.
- Font
Statistics - Font statistics for heading detection.
- Layout
Analyzer - Layout analyzer for extracting structured text from PDF pages.
- Parse
Options - Options for parsing PDF documents.
- PdfParser
- PDF document parser.
- Table
Detector - Detects tables in a list of text spans.
- Table
Detector Config - Table detector configuration.
- Table
RowData - A row of text spans in a table.
- Text
Block - A text block (paragraph, heading, etc.).
- Text
Line - A text line composed of multiple spans on the same baseline.
- Text
Span - A text span with position and style information.
Enums§
- Block
Type - Type of text block.
- Error
Mode - Error handling mode during parsing.
- Extract
Mode - What content to extract from the document.