Expand description
§llama-runtime
Runtime execution and verification helpers for llama.rs.
This crate includes:
- A
MockEnginedemonstrating the narrow-waistLlamaEnginetrait - A Phase-1 verification harness for LLAMA-006:
full_forward(prompt)logits vsprefill(prompt[:-1]) + decode(last_token)logits.
Structs§
- KvEquivalence
Report - Result of a KV equivalence run.
- Mock
Engine - A mock engine implementation for Milestone A.
- Runtime
Verifier - Minimal deterministic runtime verifier used for LLAMA-006 true tests.
Enums§
- Runtime
Error - Errors for runtime verification routines.