gradatum-engine
On-device inference adapter: provides
Chat,Embedder, andRerankertrait implementations backed by candle or llama.cpp. Optional viaengine-localfeature gate.
Status : Alpha — placeholder v0.0.1. Source code private until v1.0 public release. See gradatum.org for project context.
Part of gradatum — Memory backbone for AI agents.
Public API (Phase 1+)
Feature flags
| Feature | Description | Default |
|---|---|---|
engine-local |
Enables local inference backends (candle / llama.cpp) | disabled |
Implementations (feature = engine-local)
/// Chat implementation backed by local llama.cpp model.
/// Embedder implementation backed by local candle model (bge-small-en-v1.5).
/// Reranker implementation backed by local cross-encoder model.
Anti-cycle invariant
gradatum-engine MAY depend on gradatum-chat and gradatum-embed.
gradatum-chat and gradatum-embed MUST NOT depend on gradatum-engine.
Composition happens at binary level (gradatum-server, gradatum-worker).
Documentation
- Project : https://gradatum.org
- Source : private until v1.0
- Roadmap : Phase 1+ (candle CPU benchmarked at 17ms/embed on a modern CPU)
- License : Apache-2.0