Expand description
Arrow schemas for the graph store.
Three foundational tables:
- Triples: subject/predicate/object quads with provenance
- Embeddings: entity vectors (
FixedSizeList<f32>) - Metadata: per-entity access tracking
The triples schema includes an optional layer column (UInt8) for
sub-partitioning within namespaces. When layers are not needed,
the column is still present but set to 0.
Modules§
- chunk_
col - Named column indices for the Chunks schema (fine-grained provenance).
- col
- Named column indices for the Triples schema. Use these instead of hardcoded integers when accessing RecordBatch columns.
Constants§
- CHUNKS_
SCHEMA_ VERSION - Current schema version for the Chunks table.
- DEFAULT_
EMBEDDING_ DIM - Default embedding dimension (all-MiniLM-L6-v2 = 384, but 768 is future-proof).
- TRIPLES_
SCHEMA_ VERSION - Current schema version for the Triples table.
Functions§
- chunks_
schema - Schema for the Chunks table — fine-grained document provenance.
- embeddings_
schema - Schema for the Embeddings table — vector representations of entities.
- embeddings_
schema_ with_ dim - Embeddings schema with a custom vector dimension.
- metadata_
schema - Schema for the Metadata table — per-entity access tracking.
- normalize_
to_ current - Normalize a RecordBatch from an older schema version to the current version.
- triples_
schema - Schema for the Triples table — the core knowledge representation.