llama-runtime
Runtime execution and verification helpers for llama.rs.
This crate includes:
- A
MockEnginedemonstrating the narrow-waistLlamaEnginetrait - A Phase-1 verification harness for LLAMA-006:
full_forward(prompt)logits vsprefill(prompt[:-1]) + decode(last_token)logits.