pdf_oxide 0.2.2

Production-grade PDF parsing: spec-compliant text extraction, intelligent reading order, OCR support. 47.9× faster than PyMuPDF4LLM.

pdf_oxide

There is very little structured metadata to build this page from currently. You should check the main library docs, readme, or Cargo.toml in case the author documented the features in them.

This version has 11 feature flags, 0 of them enabled by default.

default

This feature flag does not enable additional features.

debug-span-merging

This feature flag does not enable additional features.

gpu

logging

This feature flag does not enable additional features.

ml

ocr

pyo3

python

table-ml

tesseract-rs

wasm

wasm-ml