PDF Process
Library for processing PDF files in Rust, wraps the CLI utilities provided by Poppler specifically pdftotext (Text extraction), pdftocairo (Image rendering), pdfinfo (Extracting basic details)
Provides functionality for:
- Extracting PDF text contents
- Rendering PDF files to images (PNG/JPEG/TIFF)
- Basic PDF Details (Encryption, Page Count, Subject, Title, Creator, Author, etc..)
Prerequisites
Library developed against a Linux host. Windows is not supported
Requires Plopper be installed on your system and the utilities on your PATH. Lots
of distributions will come with this pre-installed. You can check if its installed by using pdfinfo -v which should
produce an output similar to:
&
Otherwise you can install it with one of the commands below:
Fedora:
Adjust the command above for your specific Linux distribution
Installation
Install with cargo:
Or add the following to the [dependencies] section of your Cargo.toml:
= "0.1.0"
Tested
Tested against:
- pdftotext version 24.02.0
- pdftocairo version 24.02.0
- pdfinfo version 24.02.0