Expand description
Core OOXML extraction APIs for oxdoc.
This crate reads Office Open XML packages without rendering them. It exposes path-based helpers for DOCX/PPTX text extraction, XLSX-to-CSV extraction, and package metadata. Extraction returns useful output plus recoverable warnings, while unrecoverable package and parser failures are returned as typed errors.
let extraction = oxdoc_core::extract_docx_text("contract.docx")?;
println!("{}", extraction.value);The public API follows semantic versioning from 1.0 onward.
Re-exports§
pub use models::AuditSignal;pub use models::DocumentAudit;pub use models::DocumentInfo;pub use models::DocumentType;pub use models::Extraction;pub use models::OutputWarning;pub use models::StructuredText;pub use models::TextBlock;pub use models::XlsxCsvOptions;pub use models::XlsxSheet;pub use models::XlsxSheetVisibility;pub use models::XlsxValueMode;
Modules§
Enums§
Functions§
- detect_
document_ type - detect_
document_ type_ from_ reader - extract_
docx_ structured_ text - extract_
docx_ structured_ text_ from_ reader - extract_
docx_ text - extract_
docx_ text_ from_ reader - extract_
pptx_ structured_ text - extract_
pptx_ structured_ text_ from_ reader - extract_
pptx_ text - extract_
pptx_ text_ from_ reader - extract_
xlsx_ csv - extract_
xlsx_ csv_ from_ reader - extract_
xlsx_ csv_ from_ reader_ with_ value_ mode - extract_
xlsx_ csv_ with_ value_ mode - list_
xlsx_ sheets - list_
xlsx_ sheets_ from_ reader - list_
xlsx_ sheets_ from_ reader_ with_ hidden - list_
xlsx_ sheets_ with_ hidden - read_
audit - read_
audit_ from_ reader - read_
info - read_
info_ from_ reader