pub enum KVCacheFormat {
F32,
Int8,
Fp8E4M3,
Fp8E5M2,
}Expand description
KV cache storage format
Variants§
F32
Full precision (default)
Int8
INT8 with per-head symmetric quantization: value = scale * int8_value
Fp8E4M3
FP8 E4M3 format (4 exponent, 3 mantissa bits) - good for inference
Fp8E5M2
FP8 E5M2 format (5 exponent, 2 mantissa bits) - wider range, less precision
Trait Implementations§
Source§impl Clone for KVCacheFormat
impl Clone for KVCacheFormat
Source§fn clone(&self) -> KVCacheFormat
fn clone(&self) -> KVCacheFormat
Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreimpl Copy for KVCacheFormat
Source§impl Debug for KVCacheFormat
impl Debug for KVCacheFormat
Source§impl PartialEq for KVCacheFormat
impl PartialEq for KVCacheFormat
Source§fn eq(&self, other: &KVCacheFormat) -> bool
fn eq(&self, other: &KVCacheFormat) -> bool
Tests for
self and other values to be equal, and is used by ==.impl StructuralPartialEq for KVCacheFormat
Auto Trait Implementations§
impl Freeze for KVCacheFormat
impl RefUnwindSafe for KVCacheFormat
impl Send for KVCacheFormat
impl Sync for KVCacheFormat
impl Unpin for KVCacheFormat
impl UnsafeUnpin for KVCacheFormat
impl UnwindSafe for KVCacheFormat
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
impl<A, B, T> HttpServerConnExec<A, B> for Twhere
B: Body,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more