Skip to main content

Module llama_cache_interface

Module llama_cache_interface 

Source
Expand description

Interface to llama.cpp’s attention KV cache for infinite context management

This module provides the bridge between our abstract KV cache management system and the actual llama.cpp attention KV cache that holds the transformer’s state.

Structs§

LlamaKVCacheInterface
Interface for interacting with llama.cpp’s KV cache
LlamaKVCacheState
Represents the actual llama.cpp KV cache state