pdf-extract
A rust library to extract content from PDF files.
let bytes = read.unwrap;
let out = extract_text_from_mem.unwrap;
assert!;
See also
- https://github.com/elacin/PDFExtract/
- https://github.com/euske/pdfminer
- https://github.com/CrossRef/pdfextract
- https://github.com/VikParuchuri/marker
- https://github.com/kermitt2/pdfalto used by grobid