Structs§
- Work
Text Result - Result of extracting text from a work’s PDF.
- Zotero
Item Info - Brief Zotero library info for a work matched by DOI.
Enums§
- PdfSource
- Where the PDF was obtained from.
- Processing
Mode - Work
Text Error - Errors from the work_text pipeline.
Functions§
- extract_
text_ bytes - Extract text from PDF bytes using pdf-extract.
- find_
work_ in_ zotero - Check if a work exists in the Zotero library, matched by DOI.
- poll_
zotero_ for_ work - Poll Zotero for a work by DOI. Waits 5s initially, then polls every 2s for up to ~2 min.
- try_
zotero - Try to find and download a PDF from Zotero (local storage first, then remote API).
- work_
text - Download and extract the full text of a scholarly work.