llama-sampling
Sampling and decoding strategies for llama.rs.
Supports:
- Greedy (argmax)
- Temperature scaling
- Top-k filtering
- Top-p (nucleus) filtering
- Repetition penalty
- Deterministic seeded RNG for reproducible generation
Sampling and decoding strategies for llama.rs.
Supports: