Skip to main content

Module layout

Module layout 

Source
Expand description

Spatial grouping of text blocks into lines, paragraphs, and tables.

Structs§

DocxImage
An image to embed into the resulting DOCX document.
Paragraph
A paragraph composed of one or more Runs.
Run
A run of text with consistent formatting — the smallest unit emitted to the resulting DOCX.
Table
A table reconstructed from text blocks aligned in columns.

Enums§

PageElement
One element in the per-page layout: a paragraph, a table, or an image.

Functions§

analyze_page
Analyze text blocks from a page and group them into paragraphs and tables.
map_font_name
Detect bold/italic from a PDF font name.