llama-sampling 0.1.1

Sampling strategies for llama.rs — temperature, top-k/p, repetition penalty
Documentation
  • Coverage
  • 54.55%
    12 out of 22 items documented0 out of 13 items with examples
  • Size
  • Source code size: 14.63 kB This is the summed size of all the files inside the crates.io package for this release.
  • Documentation size: 3.04 MB This is the summed size of all files generated by rustdoc for all configured targets
  • Ø build duration
  • this release: 11s Average build duration of successful builds.
  • all releases: 13s Average build duration of successful builds in releases after 2024-10-23.
  • Links
  • stevedores-org/llama.rs
    1 0 0
  • crates.io
  • Dependencies
  • Versions
  • Owners
  • community-stevedores-org

llama-sampling

Sampling and decoding strategies for llama.rs.

Supports:

  • Greedy (argmax)
  • Temperature scaling
  • Top-k filtering
  • Top-p (nucleus) filtering
  • Repetition penalty
  • Deterministic seeded RNG for reproducible generation