Skip to main content

Module char_extraction

Module char_extraction 

Source
Expand description

Character bounding box calculation from content stream events.

Combines font metrics, text state, and CTM to calculate the final bounding box for each character in top-left origin coordinates. This bridges Layer 2 (interpreter) and Layer 3 (object extraction).

Functionsยง

char_from_event
Convert a CharEvent and font metrics into a fully-populated Char with bounding box in top-left origin page coordinates.