pub struct QuantizationMetadata {
pub scheme: QuantizationScheme,
pub group_size: u32,
pub scale_bits: u16,
pub zero_point: i8,
pub has_per_group_scales: bool,
}Expand description
Quantization metadata for HCT v2.
Contains information needed to dequantize INT4/INT8 weights.
Fields§
§scheme: QuantizationSchemeQuantization scheme used.
group_size: u32Group size for group-wise quantization (0 = per-tensor).
scale_bits: u16Global scale factor (f16 stored as u16 bits).
zero_point: i8Global zero point (for asymmetric quantization).
has_per_group_scales: boolWhether per-group scales are stored after compressed data.
Implementations§
Trait Implementations§
Source§impl Clone for QuantizationMetadata
impl Clone for QuantizationMetadata
Source§fn clone(&self) -> QuantizationMetadata
fn clone(&self) -> QuantizationMetadata
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for QuantizationMetadata
impl Debug for QuantizationMetadata
Source§impl Default for QuantizationMetadata
impl Default for QuantizationMetadata
Source§fn default() -> QuantizationMetadata
fn default() -> QuantizationMetadata
Returns the “default value” for a type. Read more
Source§impl PartialEq for QuantizationMetadata
impl PartialEq for QuantizationMetadata
impl StructuralPartialEq for QuantizationMetadata
Auto Trait Implementations§
impl Freeze for QuantizationMetadata
impl RefUnwindSafe for QuantizationMetadata
impl Send for QuantizationMetadata
impl Sync for QuantizationMetadata
impl Unpin for QuantizationMetadata
impl UnwindSafe for QuantizationMetadata
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more