vil_speculative 0.2.2

VIL Speculative Decoding Proxy — draft+verify for 2-3x faster LLM generation
Documentation

vil_speculative

VIL Speculative Decoding Proxy — draft+verify for 2-3x faster LLM generation

Part of VIL

This crate is part of VIL — a process-oriented language and framework for building zero-copy, high-performance distributed systems.

License

Licensed under either of Apache License 2.0 or MIT License.