llama-runtime 0.1.1

Execution runtime for llama.rs — oxidizedMLX integration and backend selection
Documentation
  • Coverage
  • 54.55%
    6 out of 11 items documented0 out of 6 items with examples
  • Size
  • Source code size: 11.24 kB This is the summed size of all the files inside the crates.io package for this release.
  • Documentation size: 1.92 MB This is the summed size of all files generated by rustdoc for all configured targets
  • Ø build duration
  • this release: 17s Average build duration of successful builds.
  • all releases: 19s Average build duration of successful builds in releases after 2024-10-23.
  • Links
  • stevedores-org/llama.rs
    0 0 18
  • crates.io
  • Dependencies
  • Versions
  • Owners
  • community-stevedores-org

llama-runtime

Runtime execution and verification helpers for llama.rs.

This crate includes a Phase-1 verification harness for LLAMA-006: full_forward(prompt) logits vs prefill(prompt[:-1]) + decode(last_token) logits.