Expand description
Multi-page PDF document processing.
This module provides functionality to process multi-page PDF documents, extracting text and structure from each page with support for page ordering, table of contents generation, and cross-page table handling.
Structsยง
- PdfDocument
Result - Result of processing a multi-page PDF document.
- PdfMetadata
- PDF document metadata.
- PdfPage
- Represents a processed page from a PDF document.
- PdfProcessing
Config - Configuration for PDF processing.
- PdfProcessor
- PDF document processor.
- Search
Result - Search result for text search in PDF.
- TocEntry
- Table of contents entry.