Expand description
Canonical document model (urn:ethos:schema:document:1). Field-for-field mirror of
the schema; serialization through these types + crate::c14n is the only way Ethos
emits the document artifact.
Structs§
- Cell
- A table cell.
- Chunk
- A RAG chunk.
- Coordinate
System - Coordinate system declaration.
- Document
- Top-level document artifact (
ethos.json). - Element
- A layout element.
- Page
- A page.
- PageBox
- A page-anchored bbox (citation target).
- Parser
Info - Producing parser identity.
- Payload
- The emitted document payload.
- Profile
Ref - Deterministic profile reference.
- Region
- A non-text region with stable coordinates.
- Source
Info - Source identity.
- Span
- An extracted text span.
- Span
Origin Locator - Origin-derived text locator that remains stable when PDFium bbox dimensions drift.
- Table
- A table.
- Table
Exports - Optional derived table exports.
- Token
Estimate - Token estimate.
- Warning
- A stable, deterministic warning (fixed-template message).
Enums§
- Element
Type - Element type enum (wire: snake_case).
- Region
Kind - Non-text region kind. Base tier emits
unknownunless deterministic gates are met.