Skip to main content

Module text_extractor

Module text_extractor 

Source
Expand description

Text extraction from PDF content streams.

Walks content stream operations and produces TextChunks with position, font, and Unicode text information.

Functions§

extract_text_chunks
Extract text chunks from a single page.