Expand description
Core data model for fleischwolf.
This crate is the Rust counterpart of the docling-core Python package: it
owns the unified DoclingDocument representation that every backend
produces and every serializer consumes. Keeping it dependency-light and
separate from the conversion logic mirrors the Python split between
docling-core (the schema) and docling (the converters).
Phase 0 models a simplified, linear node tree that is enough to round-trip
through Markdown. The faithful, $ref-based schema that matches
docling-core’s JSON wire format lands in Phase 1 (see MIGRATION.md).
Modules§
- base64
- Minimal standard-alphabet Base64 codec (RFC 4648):
encodefor embedding image bytes asdata:URIs,decodefor reading them back out — avoids a dependency for the two things we need.
Structs§
- Docling
Document - The unified, format-agnostic document produced by every backend.
- Picture
Image - An extracted picture’s raw encoded bytes plus its mimetype and pixel size —
the fleischwolf analogue of docling-core’s
ImageRef. - Table
- A simple row-major table.
rows[0]is the header row.
Enums§
- DocItem
Label - Semantic role of a document item, mirroring docling-core’s
DocItemLabel. - Image
Mode - How pictures are rendered (mirrors docling-core’s
ImageRefMode). - Node
- A single piece of document content.