abyo-speculate 0.5.0

Pure Rust Speculative Decoding library for local LLMs — vanilla SD + Medusa, Qwen2 + Llama, batch-1 optimised
Documentation
1
2
3
4
5
//! Sampling utilities: softmax, top-p, temperature, rejection sampling.

pub mod tokens;

pub use tokens::{sample_from_distribution, softmax_with_temperature, top_p_filter};