Skip to main content

Module document

Module document 

Source
Expand description

Document inspection (v2).

Future capabilities:

  • Detect document format (PDF, DOCX, TXT, HTML, Markdown)
  • Extract page count, word count, structure
  • Estimate chunking strategy and token cost

Functions§

inspect_bytes
Inspect document bytes and extract metadata.