Skip to main content

Module extract

Module extract 

Source
Expand description

Input extraction backends.

Each submodule turns a specific input format (PDF, future: Word, HTML) into a PNG image ready for OCR.

Modules§

pdfpdf-input
PDF text extraction and rasterization via pdfium-render.

Enums§

InputKind
Source kind detected for an input path.