pub enum Dtype {
Bf16,
F16,
F32,
Fp8E4m3,
}Expand description
Numeric dtype of cache entries. Matches the engine-side dtype 1:1 — we never convert here.
Variants§
Bf16
IEEE bfloat16 — vLLM and SGLang default for Llama-class models.
F16
IEEE binary16.
F32
IEEE binary32 (single-precision; rare in production).
Fp8E4m3
8-bit FP, E4M3 layout.
Implementations§
Trait Implementations§
Source§impl<'de> Deserialize<'de> for Dtype
impl<'de> Deserialize<'de> for Dtype
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
impl Copy for Dtype
impl Eq for Dtype
impl StructuralPartialEq for Dtype
Auto Trait Implementations§
impl Freeze for Dtype
impl RefUnwindSafe for Dtype
impl Send for Dtype
impl Sync for Dtype
impl Unpin for Dtype
impl UnsafeUnpin for Dtype
impl UnwindSafe for Dtype
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more