## pdf-extract
[](https://github.com/jrmuizel/pdf-extract/actions)
[](https://crates.io/crates/pdf-extract)
[](https://docs.rs/pdf-extract)
A rust library to extract content from PDF files.
```rust
let bytes = std::fs::read("tests/docs/simple.pdf").unwrap();
let out = pdf_extract::extract_text_from_mem(&bytes).unwrap();
assert!(out.contains("This is a small demonstration"));
```
## See also
- https://github.com/elacin/PDFExtract/
- https://github.com/euske/pdfminer
- https://github.com/CrossRef/pdfextract
- https://github.com/VikParuchuri/marker
- https://github.com/kermitt2/pdfalto used by [grobid](https://github.com/kermitt2/grobid/)
### Not PDF specific
- https://github.com/Layout-Parser/layout-parser