Expand description
Interface to llama.cpp’s attention KV cache for infinite context management
This module provides the bridge between our abstract KV cache management system and the actual llama.cpp attention KV cache that holds the transformer’s state.
Structs§
- LlamaKV
Cache Interface - Interface for interacting with llama.cpp’s KV cache
- LlamaKV
Cache State - Represents the actual llama.cpp KV cache state