The fastest Rust PDF library with text extraction: 0.8ms mean, 100% pass rate on 3,830 PDFs. 5× faster than pdf_extract, 17× faster than oxidize_pdf. Extract, create, and edit PDFs.
#ML Training
Scripts for training and fine-tuning ML models.
##Structure-`dataset/` - Dataset preparation tools
-`output/` - Training outputs (checkpoints, logs)
-`*.py` - Training scripts
For OCR documentation, see the OCR module documentation.