Expand description
Safe wrapper around llama_context.
Submodules:
tensor_capture— hookcb_evalduringLlamaContext::decodeto copy intermediate tensors (per-layer hidden states, norms, …).memory_breakdown— per-buffer memory usage after load/decode.kv_cache— sequence copy, shift, and clear helpers.
Re-exports§
pub use memory_breakdown::MemoryBreakdownEntry;pub use tensor_capture::CapturedTensor;pub use tensor_capture::TensorCapture;
Modules§
- kv_
cache - utilities for working with the kv cache
- memory_
breakdown - Per-buffer-type memory usage reported by llama.cpp.
- params
- A safe wrapper around
llama_context_params. - perf
- Safe wrapper around
llama_perf_context_data. - session
- utilities for working with session files
- tensor_
capture - Capture intermediate tensor outputs during
crate::LlamaContext::decode.
Structs§
- Llama
Context - A safe wrapper around the
llama_contextC++ context.