vil_speculative
VIL Speculative Decoding Proxy — draft+verify for 2-3x faster LLM generation
Part of VIL
This crate is part of VIL — a process-oriented language and framework for building zero-copy, high-performance distributed systems.
License
Licensed under either of Apache License 2.0 or MIT License.