pub struct MultiHeadKvCache { /* private fields */ }Expand description
Multi-head KV cache for transformer attention.
Maintains one KvCacheCompressor per attention head.
Implementations§
Source§impl MultiHeadKvCache
impl MultiHeadKvCache
Sourcepub fn new(num_heads: usize, config: &KvCacheConfig) -> Result<Self>
pub fn new(num_heads: usize, config: &KvCacheConfig) -> Result<Self>
Create a new multi-head KV cache.
§Arguments
num_heads— number of attention headsconfig— shared configuration applied to all heads
§Errors
- Any error from
KvCacheCompressor::new
Sourcepub fn push_token(&mut self, keys: &[&[f32]], values: &[&[f32]]) -> Result<()>
pub fn push_token(&mut self, keys: &[&[f32]], values: &[&[f32]]) -> Result<()>
Push a token’s KV vectors for all heads simultaneously.
keys and values must each have exactly num_heads slices of length head_dim.
§Errors
DimensionMismatch,NonFiniteInput
Trait Implementations§
Source§impl Clone for MultiHeadKvCache
impl Clone for MultiHeadKvCache
Source§fn clone(&self) -> MultiHeadKvCache
fn clone(&self) -> MultiHeadKvCache
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreAuto Trait Implementations§
impl Freeze for MultiHeadKvCache
impl RefUnwindSafe for MultiHeadKvCache
impl Send for MultiHeadKvCache
impl Sync for MultiHeadKvCache
impl Unpin for MultiHeadKvCache
impl UnsafeUnpin for MultiHeadKvCache
impl UnwindSafe for MultiHeadKvCache
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
The inverse inclusion map: attempts to construct
self from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
Checks if
self is actually part of its subset T (and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
Use with care! Same as
self.to_subset but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
The inclusion map: converts
self to the equivalent element of its superset.