Module pdf_processing

Module pdf_processing 

Source
Expand description

Multi-page PDF document processing.

This module provides functionality to process multi-page PDF documents, extracting text and structure from each page with support for page ordering, table of contents generation, and cross-page table handling.

Structsยง

PdfDocumentResult
Result of processing a multi-page PDF document.
PdfMetadata
PDF document metadata.
PdfPage
Represents a processed page from a PDF document.
PdfProcessingConfig
Configuration for PDF processing.
PdfProcessor
PDF document processor.
SearchResult
Search result for text search in PDF.
TocEntry
Table of contents entry.