pub struct QuantizedMatmulParams {
pub m: u32,
pub k: u32,
pub n: u32,
pub group_size: u32,
pub bits: u32,
}Expand description
Parameters describing the quantized matmul dimensions and format.
Fields§
§m: u32Number of input rows (tokens).
k: u32Inner dimension (shared between input and weight).
n: u32Number of output columns.
group_size: u32Number of consecutive values sharing one scale/bias pair.
bits: u32Quantization bit width (4, 6, or 8).
Trait Implementations§
Source§impl Clone for QuantizedMatmulParams
impl Clone for QuantizedMatmulParams
Source§fn clone(&self) -> QuantizedMatmulParams
fn clone(&self) -> QuantizedMatmulParams
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for QuantizedMatmulParams
impl Debug for QuantizedMatmulParams
impl Copy for QuantizedMatmulParams
Auto Trait Implementations§
impl Freeze for QuantizedMatmulParams
impl RefUnwindSafe for QuantizedMatmulParams
impl Send for QuantizedMatmulParams
impl Sync for QuantizedMatmulParams
impl Unpin for QuantizedMatmulParams
impl UnsafeUnpin for QuantizedMatmulParams
impl UnwindSafe for QuantizedMatmulParams
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more