llama-sampling
Sampling and decoding strategies for llama.rs. Implements greedy, temperature, top-k, top-p, and repetition penalty sampling with deterministic seeded RNG for reproducible test runs.
Sampling and decoding strategies for llama.rs. Implements greedy, temperature, top-k, top-p, and repetition penalty sampling with deterministic seeded RNG for reproducible test runs.