pub enum KVCacheType {
F32,
TurboQuantMSE {
bits: u8,
},
TurboQuantProd {
bits: u8,
},
}Expand description
Which KV cache implementation to use.
Variants§
F32
Standard f32 KV cache (default).
TurboQuantMSE
TurboQuant MSE: Hadamard rotation + scalar quantization (biased, lower overhead).
TurboQuantProd
TurboQuant prod: MSE + QJL correction (unbiased, higher accuracy).
Implementations§
Source§impl KVCacheType
impl KVCacheType
Sourcepub fn to_tq_config(&self, dim: usize) -> Option<TurboQuantConfig>
pub fn to_tq_config(&self, dim: usize) -> Option<TurboQuantConfig>
Convert to a TurboQuantConfig if this is a TurboQuant variant.
Sourcepub fn is_turboquant(&self) -> bool
pub fn is_turboquant(&self) -> bool
Whether this is any TurboQuant variant.
Trait Implementations§
Source§impl Clone for KVCacheType
impl Clone for KVCacheType
Source§fn clone(&self) -> KVCacheType
fn clone(&self) -> KVCacheType
Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreimpl Copy for KVCacheType
Source§impl Debug for KVCacheType
impl Debug for KVCacheType
Source§impl Default for KVCacheType
impl Default for KVCacheType
Source§impl<'de> Deserialize<'de> for KVCacheType
impl<'de> Deserialize<'de> for KVCacheType
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
impl Eq for KVCacheType
Source§impl PartialEq for KVCacheType
impl PartialEq for KVCacheType
Source§fn eq(&self, other: &KVCacheType) -> bool
fn eq(&self, other: &KVCacheType) -> bool
Tests for
self and other values to be equal, and is used by ==.Source§impl Serialize for KVCacheType
impl Serialize for KVCacheType
impl StructuralPartialEq for KVCacheType
Auto Trait Implementations§
impl Freeze for KVCacheType
impl RefUnwindSafe for KVCacheType
impl Send for KVCacheType
impl Sync for KVCacheType
impl Unpin for KVCacheType
impl UnsafeUnpin for KVCacheType
impl UnwindSafe for KVCacheType
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> DeserializeOwned for Twhere
T: for<'de> Deserialize<'de>,
impl<A, B, T> HttpServerConnExec<A, B> for Twhere
B: Body,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more