shimmy 1.7.0

Lightweight sub-5MB Ollama alternative with native SafeTensors support. No Python dependencies, 2x faster loading. Now with GitHub Spec-Kit integration for systematic development.
1
2
3
4
5
// Response caching for identical inference requests

pub mod response_cache;

pub use response_cache::ResponseCache;