Skip to main content

Crate pdf_docx

Crate pdf_docx 

Source
Expand description

PDF to DOCX conversion with text, tables, and images.

Extracts text blocks, images, and spatial layout from PDF documents and produces valid OOXML (.docx) files.

Re-exports§

pub use error::DocxError;
pub use error::Result;
pub use layout::DocxImage;
pub use layout::PageElement;
pub use layout::Paragraph;
pub use layout::Run;
pub use layout::Table;

Modules§

error
Error types for PDF → DOCX conversion.
layout
Spatial grouping of text blocks into lines, paragraphs, and tables.
writer
DOCX OOXML writer using quick-xml and zip.

Functions§

convert_pdf_bytes_to_docx
Convert a PDF file (bytes) to DOCX format.
pdf_to_docx
Convert a PDF document to DOCX format.
pdf_to_docx_text_only
Convert a PDF document to DOCX format, text only (no images).