Module document_parser

Module document_parser 

Source
Expand description

§Document Parser

Comprehensive document parsing with multi-modal content extraction.

Structs§

ChartReference
Chart reference in document
DocumentParseResult
Document parsing result
DocumentParser
Document parser for multi-modal content
DocumentParserConfig
Document parser configuration
ExtractedContent
Extracted content from document
HTMLTextExtractor
HTML text extractor
HeadingDetector
Heading detection component
HeadingPattern
Heading pattern
ImageReference
Image reference in document
LayoutDetector
Layout detection component
PDFTextExtractor
PDF text extractor
ParseStatistics
Parsing statistics
PowerPointTextExtractor
PowerPoint text extractor
ReadingOrderAnalyzer
Reading order analyzer
SectionAnalyzer
Section analysis component
SectionPattern
Section detection pattern
TextExtractionConfig
Text extraction configuration
TextExtractor
Text extraction component
WordTextExtractor
Word document text extractor

Enums§

ReadingOrderStrategy
Reading order strategies