Expand description
Chunk types — atomic units of extracted content.
Structs§
- Image
Chunk - Image bounding box — actual pixel data extracted at output time.
- Line
ArtChunk - Vector graphic — collection of line segments forming bullets, decorations, etc.
- Line
Chunk - Line segment — used for table border detection.
- Text
Chunk - Atomic text fragment — one font run in the PDF content stream.
Constants§
- LINE_
ART_ SIZE_ EPSILON - Size comparison tolerance for line art classification.