vil_speculative 0.1.1

VIL Speculative Decoding Proxy — draft+verify for 2-3x faster LLM generation
Documentation