pub enum CpuQuantStore {
Q4K {
weights: Vec<f32>,
n_rows: usize,
n_cols: usize,
},
}Expand description
CPU-side container for any GGUF k-quant flavour. Each variant holds
the dense fp32 weights post-eager-dequant — CPU isn’t the bench
target so we don’t pay the complexity of on-the-fly dequant here;
the variant tag exists so gemm_quant can route consistently.
New k-quant types (Q5_K / Q6_K / Q8_0) become new variants — no
trait churn, just a new arm in load_quant and gemm_quant.
Variants§
Auto Trait Implementations§
impl Freeze for CpuQuantStore
impl RefUnwindSafe for CpuQuantStore
impl Send for CpuQuantStore
impl Sync for CpuQuantStore
impl Unpin for CpuQuantStore
impl UnsafeUnpin for CpuQuantStore
impl UnwindSafe for CpuQuantStore
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more