Expand description
Spatial grouping of text blocks into lines, paragraphs, and tables.
Structs§
- Docx
Image - An image to embed into the resulting DOCX document.
- Paragraph
- A paragraph composed of one or more
Runs. - Run
- A run of text with consistent formatting — the smallest unit emitted to the resulting DOCX.
- Table
- A table reconstructed from text blocks aligned in columns.
Enums§
- Page
Element - One element in the per-page layout: a paragraph, a table, or an image.
Functions§
- analyze_
page - Analyze text blocks from a page and group them into paragraphs and tables.
- map_
font_ name - Detect bold/italic from a PDF font name.