Expand description
§tesseract-ocr-static-c
This crate bundles Tesseract OCR and Leptonica libraries. These two libraries are built together with Musl libc and LLVM libcxx and linked statically. The build should be reproducible since the versions of all libraries are pinned. Since there are no dependencies one needs to supply images in raw RGB/RGBA/grayscale format to Tesseract.
The build should work with both dynamically and statically linked C libraries,
i.e. *-gnu and *-musl targets.
Required CLI tools: cmake, make, git, python3, curl, tar, zstd.
Required compiler: Clang 20+.
§Environment variables
The following environment variables affect the build process.
| Variable | Default value | Comment |
|---|---|---|
PATH | Executable search path | |
TESSERACT_CC | clang | C compiler |
TESSERACT_CXX | clang++ | C++ compiler |
TESSERACT_AR | llvm-ar | |
TESSERACT_RANLIB | llvm-ranlib | |
TESSERACT_CFLAGS | -O3 | C compiler flags |
TESSERACT_CXXFLAGS | -O3 | C++ compiler flags |
TESSERACT_LDFLAGS | Linker flags | |
TESSERACT_BUILD_FROM_SOURCE | If set, Tesseract OCR is built from source; otherwise an attempt is made to download pre-built binary. If the attempt fails, it is built from source. | |
TESSERACT_PRE_BUILT_ARCHIVE_URL | Override URL from which pre-built binary is downloaded. Normally you should have a different URL for each Rust target. | |
TESSERACT_PRE_BUILT_ARCHIVE_HASH | BLAKE2b hash of the pre-built binary archive. Must be set if you’ve overriden hard-coded archive URLs. Can be computed with b2sum CLI tool. |
§High-level interface
The following crate provides ergonomic Rust interface:
tesseract-ocr-static.
Structs§
Constants§
- Tess
OcrEngine Mode_ OEM_ DEFAULT - Tess
OcrEngine Mode_ OEM_ LSTM_ ONLY - Tess
OcrEngine Mode_ OEM_ TESSERACT_ LSTM_ COMBINED - Tess
OcrEngine Mode_ OEM_ TESSERACT_ ONLY - Tess
Orientation_ ORIENTATION_ PAGE_ DOWN - Tess
Orientation_ ORIENTATION_ PAGE_ LEFT - Tess
Orientation_ ORIENTATION_ PAGE_ RIGHT - Tess
Orientation_ ORIENTATION_ PAGE_ UP - Tess
Page Iterator Level_ RIL_ BLOCK - Tess
Page Iterator Level_ RIL_ PARA - Tess
Page Iterator Level_ RIL_ SYMBOL - Tess
Page Iterator Level_ RIL_ TEXTLINE - Tess
Page Iterator Level_ RIL_ WORD - Tess
Page SegMode_ PSM_ AUTO - Tess
Page SegMode_ PSM_ AUTO_ ONLY - Tess
Page SegMode_ PSM_ AUTO_ OSD - Tess
Page SegMode_ PSM_ CIRCLE_ WORD - Tess
Page SegMode_ PSM_ OSD_ ONLY - Tess
Page SegMode_ PSM_ RAW_ LINE - Tess
Page SegMode_ PSM_ SINGLE_ BLOCK - Tess
Page SegMode_ PSM_ SINGLE_ BLOCK_ VERT_ TEXT - Tess
Page SegMode_ PSM_ SINGLE_ CHAR - Tess
Page SegMode_ PSM_ SINGLE_ COLUMN - Tess
Page SegMode_ PSM_ SINGLE_ LINE - Tess
Page SegMode_ PSM_ SINGLE_ WORD - Tess
Page SegMode_ PSM_ SPARSE_ TEXT - Tess
Page SegMode_ PSM_ SPARSE_ TEXT_ OSD - Tess
Paragraph Justification_ JUSTIFICATION_ CENTER - Tess
Paragraph Justification_ JUSTIFICATION_ LEFT - Tess
Paragraph Justification_ JUSTIFICATION_ RIGHT - Tess
Paragraph Justification_ JUSTIFICATION_ UNKNOWN - Tess
Poly Block Type_ PT_ CAPTION_ TEXT - Tess
Poly Block Type_ PT_ EQUATION - Tess
Poly Block Type_ PT_ FLOWING_ IMAGE - Tess
Poly Block Type_ PT_ FLOWING_ TEXT - Tess
Poly Block Type_ PT_ HEADING_ IMAGE - Tess
Poly Block Type_ PT_ HEADING_ TEXT - Tess
Poly Block Type_ PT_ HORZ_ LINE - Tess
Poly Block Type_ PT_ INLINE_ EQUATION - Tess
Poly Block Type_ PT_ NOISE - Tess
Poly Block Type_ PT_ PULLOUT_ IMAGE - Tess
Poly Block Type_ PT_ PULLOUT_ TEXT - Tess
Poly Block Type_ PT_ TABLE - Tess
Poly Block Type_ PT_ UNKNOWN - Tess
Poly Block Type_ PT_ VERTICAL_ TEXT - Tess
Poly Block Type_ PT_ VERT_ LINE - Tess
Textline Order_ TEXTLINE_ ORDER_ LEFT_ TO_ RIGHT - Tess
Textline Order_ TEXTLINE_ ORDER_ RIGHT_ TO_ LEFT - Tess
Textline Order_ TEXTLINE_ ORDER_ TOP_ TO_ BOTTOM - Tess
Writing Direction_ WRITING_ DIRECTION_ LEFT_ TO_ RIGHT - Tess
Writing Direction_ WRITING_ DIRECTION_ RIGHT_ TO_ LEFT - Tess
Writing Direction_ WRITING_ DIRECTION_ TOP_ TO_ BOTTOM
Functions§
- Tess
BaseAPI ⚠Analyse Layout - Tess
BaseAPI ⚠Clear - Tess
BaseAPI ⚠Clear Adaptive Classifier - Tess
BaseAPI ⚠Clear Persistent Cache - Tess
BaseAPI ⚠Create - Tess
BaseAPI ⚠Delete - Tess
BaseAPI ⚠End - Tess
BaseAPI ⚠GetAlto Text - Tess
BaseAPI ⚠GetBool Variable - Tess
BaseAPI ⚠GetBox Text - Tess
BaseAPI ⚠GetDatapath - Tess
BaseAPI ⚠GetDouble Variable - Tess
BaseAPI ⚠GetGradient - Tess
BaseAPI ⚠GetHOCR Text - Tess
BaseAPI ⚠GetInt Variable - Tess
BaseAPI ⚠GetIterator - Tess
BaseAPI ⚠GetLSTM BoxText - Tess
BaseAPI ⚠GetPAGE Text - Tess
BaseAPI ⚠GetPage SegMode - Tess
BaseAPI ⚠GetString Variable - Tess
BaseAPI ⚠GetText Direction - Tess
BaseAPI ⚠GetThresholded Image - Tess
BaseAPI ⚠GetThresholded Image Scale Factor - Tess
BaseAPI ⚠GetTsv Text - Tess
BaseAPI ⚠GetUNLV Text - Tess
BaseAPI ⚠GetUT F8Text - Tess
BaseAPI ⚠GetWord StrBox Text - Tess
BaseAPI ⚠Init2 - Tess
BaseAPI ⚠Init ForAnalyse Page - Tess
BaseAPI ⚠IsValid Word - Tess
BaseAPI ⚠Oem - Tess
BaseAPI ⚠Print Variables ToFile - Tess
BaseAPI ⚠Recognize - Tess
BaseAPI ⚠SetDebug Variable - Tess
BaseAPI ⚠SetImage2 - Tess
BaseAPI ⚠SetMin Orientation Margin - Tess
BaseAPI ⚠SetPage SegMode - Tess
BaseAPI ⚠SetRectangle - Tess
BaseAPI ⚠SetSource Resolution - Tess
BaseAPI ⚠SetVariable - Tess
Choice ⚠Iterator Confidence - Tess
Choice ⚠Iterator Delete - Tess
Choice ⚠Iterator GetUT F8Text - Tess
Choice ⚠Iterator Next - Tess
Delete ⚠Text - Tess
Monitor ⚠Create - Tess
Monitor ⚠Delete - Tess
Monitor ⚠GetProgress - Tess
Monitor ⚠SetCancel Func - Tess
Monitor ⚠SetCancel This - Tess
Monitor ⚠SetDeadlineM Secs - Tess
Monitor ⚠SetProgress Func - Tess
Page ⚠Iterator Baseline - Tess
Page ⚠Iterator Begin - Tess
Page ⚠Iterator Block Type - Tess
Page ⚠Iterator Bounding Box - Tess
Page ⚠Iterator Copy - Tess
Page ⚠Iterator Delete - Tess
Page ⚠Iterator GetBinary Image - Tess
Page ⚠Iterator GetImage - Tess
Page ⚠Iterator IsAt Beginning Of - Tess
Page ⚠Iterator IsAt Final Element - Tess
Page ⚠Iterator Next - Tess
Page ⚠Iterator Orientation - Tess
Page ⚠Iterator Paragraph Info - Tess
Result ⚠Iterator Confidence - Tess
Result ⚠Iterator Copy - Tess
Result ⚠Iterator Delete - Tess
Result ⚠Iterator GetChoice Iterator - Tess
Result ⚠Iterator GetPage Iterator - Tess
Result ⚠Iterator GetUT F8Text - Tess
Result ⚠Iterator Next - Tess
Result ⚠Iterator Symbol IsDropcap - Tess
Result ⚠Iterator Symbol IsSubscript - Tess
Result ⚠Iterator Symbol IsSuperscript - Tess
Result ⚠Iterator Word Font Attributes - Tess
Result ⚠Iterator Word IsFrom Dictionary - Tess
Result ⚠Iterator Word IsNumeric - Tess
Result ⚠Iterator Word Recognition Language - Tess
Version ⚠ - getLeptonica
Version ⚠ - pixClone⚠
- pixCreate⚠
- pixDestroy⚠
- pixGet
Data ⚠ - pixGet
Dimensions ⚠ - pixGet
Height ⚠ - pixGet
Width ⚠ - pixGet
Wpl ⚠