# pdf-ocr
OCR integration for PDFluent — Tesseract and PaddleOCR backends.
This crate is part of the [PDFluent](https://pdfluent.com) commercial Rust PDF SDK.
**Free for evaluation. Production use requires a valid license.**
## What it does
Extracts text from scanned/raster PDF pages using OCR. Two pluggable backends:
- **Tesseract** — broadly used, good general accuracy, mature
- **PaddleOCR** — better for non-Latin scripts (CJK, Arabic, etc.)
## Status
Beta. Backend-dependent quality. Not yet broadly tested at scale.
## Usage
Most users do not depend on this crate directly. Use the [`pdfluent`](https://crates.io/crates/pdfluent) facade with `ocr-tesseract` or `ocr-paddle`:
```rust
use pdfluent::prelude::*;
```
For low-level access, see <https://pdfluent.com/docs>.
## Licensing
- Free for evaluation, development, and testing
- Production use requires a valid PDFluent commercial license
- Redistribution requires the OEM Redistribution add-on
See [LICENSE](LICENSE) for full terms, or visit <https://pdfluent.com/terms>.
## Links
- Main crate: <https://crates.io/crates/pdfluent>
- Documentation: <https://pdfluent.com/docs>
- Trial: <https://pdfluent.com/trial>
- Pricing: <https://pdfluent.com/pricing>