Expand description
Layout detection via the RT-DETR (docling-layout-heron) model exported to
ONNX, run with ort. A port of docling-ibm-models’ LayoutPredictor:
resize the page image to 640×640 and rescale to [0,1] (the heron processor
has do_normalize=false), run the model, then RT-DETR
post_process_object_detection (sigmoid → top-k over query×class →
center-to-corners boxes scaled to the page).
Structs§
- Layout
Model - Region
- One detected region, in page points (top-left origin).
Constants§
- LABELS
- The 17 canonical layout classes, indexed by the model’s class id
(
config.jsonid2label).