Skip to main content

MockKvCacheHandle

Struct MockKvCacheHandle 

Source
pub struct MockKvCacheHandle { /* private fields */ }
Expand description

Mock KV cache handle — tracks block metadata without allocating real memory.

Implementations§

Source§

impl MockKvCacheHandle

Source

pub fn new(request_id: RequestId, num_layers: usize, seq_len: usize) -> Self

Trait Implementations§

Source§

impl Debug for MockKvCacheHandle

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl KvCacheHandle for MockKvCacheHandle

Source§

fn block_table(&self) -> &BlockTable

Get block table for this cache
Source§

fn block_table_mut(&mut self) -> &mut BlockTable

Get mutable block table (for extending)
Source§

fn as_any(&self) -> &dyn Any

Downcast support for backend-specific handles
Source§

fn device(&self) -> Device

Get device where cache resides
Source§

fn num_layers(&self) -> usize

Get number of layers cached
Source§

fn num_heads(&self) -> usize

Get number of attention heads
Source§

fn head_dim(&self) -> usize

Get head dimension
Source§

fn key_cache(&self, _layer: usize) -> Result<Option<TensorRef>>

Get key cache for specific layer (returns tensor reference)
Source§

fn value_cache(&self, _layer: usize) -> Result<Option<TensorRef>>

Get value cache for specific layer
Source§

fn clone_handle(&self) -> Result<Arc<dyn KvCacheHandle>>

Clone handle (creates new reference, not deep copy)
Source§

fn stats(&self) -> CacheHandleStats

Get cache statistics
Source§

fn is_valid(&self) -> bool

Check if cache is valid and accessible
Source§

fn cache_id(&self) -> String

Get unique identifier for this cache instance
Source§

fn num_tokens(&self) -> usize

Get number of tokens stored in cache
Source§

fn kv_cache( &self, layer: usize, ) -> Result<(Option<Arc<dyn TensorLike>>, Option<Arc<dyn TensorLike>>), FerrumError>

Get both key and value caches for layer

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more