atomr-infer-runtime-mistralrs
Thin wrapper over
mistralrs.
Why this exists alongside Candle
mistralrs is the most production-ready Rust-native LLM runtime
available today: KV-cache management, paged attention, GGUF + safetensors
loading, quantisation. For Mistral / Llama / Gemma models, it's
typically the right default for a Rust-only deployment.
The infer_runtime("mistralai/...") registry returns
RuntimeKind::MistralRs so deployments with Mistral-family model names
land here automatically.
Build profiles
| Build | Result |
|---|---|
cargo build -p atomr-infer-runtime-mistralrs (default) |
Stub. |
cargo build -p atomr-infer-runtime-mistralrs --features mistralrs |
Pulls the mistralrs crate. |
Configuration
use MistralRsConfig;
let cfg = MistralRsConfig ;