Skip to main content

Module line_extractor

Module line_extractor 

Source
Expand description

PDF line segment extraction — extract stroked/filled paths as LineChunks.

Parses content stream path operators (m, l, c, re, S, s, f, h) and classifies resulting paths as horizontal lines, vertical lines, or shapes.

Functions§

extract_line_chunks
Extract line segments and line art from a PDF page’s content stream.