pub struct KvCacheConfig {
pub tail_length: usize,
pub tail_precision: Precision,
pub store_precision: Precision,
pub max_tokens: usize,
pub num_kv_heads: usize,
pub head_dim: usize,
pub migration_batch: usize,
}Expand description
KV cache configuration
Fields§
§tail_length: usizeNumber of tokens to keep in high-precision tail
tail_precision: PrecisionPrecision for tail storage
store_precision: PrecisionPrecision for quantized store
max_tokens: usizeMaximum total tokens to cache
num_kv_heads: usizeNumber of KV heads
head_dim: usizeHead dimension
migration_batch: usizeMigration batch size (tokens to move at once)
Trait Implementations§
Source§impl Clone for KvCacheConfig
impl Clone for KvCacheConfig
Source§fn clone(&self) -> KvCacheConfig
fn clone(&self) -> KvCacheConfig
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for KvCacheConfig
impl Debug for KvCacheConfig
Source§impl Default for KvCacheConfig
impl Default for KvCacheConfig
Source§impl<'de> Deserialize<'de> for KvCacheConfig
impl<'de> Deserialize<'de> for KvCacheConfig
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for KvCacheConfig
impl RefUnwindSafe for KvCacheConfig
impl Send for KvCacheConfig
impl Sync for KvCacheConfig
impl Unpin for KvCacheConfig
impl UnsafeUnpin for KvCacheConfig
impl UnwindSafe for KvCacheConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more