1 2 3 4
See also: https://github.com/elacin/PDFExtract/ https://github.com/euske/pdfminer https://github.com/CrossRef/pdfextract