Expand description
§llama-sampling
Sampling and decoding strategies for llama.rs. Implements greedy, temperature, top-k, top-p, and repetition penalty sampling with deterministic seeded RNG for reproducible test runs.
Structs§
- Sampler
- Stateful sampler using deterministic RNG.
- Sampling
Config - Configuration for token sampling.
Enums§
- Sampling
Error - Error type for sampling operations.
- Sampling
Strategy - Sampling strategy.
Functions§
- apply_
repetition_ penalty - Apply repetition penalty to logits for tokens present in
history. - greedy_
sample - Greedy decoding (argmax).