pub enum DecomposedQuantMethod {
Awq {
group_size: usize,
},
Gptq {
group_size: usize,
},
}Expand description
Quantization method for decomposed (non-monolithic) formats.
Unlike GGUF’s block-quantized QuantTensor, AWQ and GPTQ store weights
as separate component tensors (qweight, scales, qzeros). This enum
identifies the specific format so the correct dequantization formula
and packing layout are used.
Variants§
Awq
AWQ (Activation-aware Weight Quantization)
Packing: 8 INT4 values per u32, AWQ order shifts [0,16,4,20,8,24,12,28] Dequant: w = (q - zero) * scale
Gptq
GPTQ (Generative Pre-trained Transformer Quantization)
Packing: 8 INT4 values per u32, sequential 4-bit packing Dequant: w = q * scale + zero Additional g_idx tensor for column permutation
Implementations§
Source§impl DecomposedQuantMethod
impl DecomposedQuantMethod
Sourcepub fn group_size(&self) -> usize
pub fn group_size(&self) -> usize
Group size for this method
Trait Implementations§
Source§impl Clone for DecomposedQuantMethod
impl Clone for DecomposedQuantMethod
Source§fn clone(&self) -> DecomposedQuantMethod
fn clone(&self) -> DecomposedQuantMethod
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for DecomposedQuantMethod
impl Debug for DecomposedQuantMethod
Source§impl PartialEq for DecomposedQuantMethod
impl PartialEq for DecomposedQuantMethod
impl Copy for DecomposedQuantMethod
impl Eq for DecomposedQuantMethod
impl StructuralPartialEq for DecomposedQuantMethod
Auto Trait Implementations§
impl Freeze for DecomposedQuantMethod
impl RefUnwindSafe for DecomposedQuantMethod
impl Send for DecomposedQuantMethod
impl Sync for DecomposedQuantMethod
impl Unpin for DecomposedQuantMethod
impl UnsafeUnpin for DecomposedQuantMethod
impl UnwindSafe for DecomposedQuantMethod
Blanket Implementations§
Source§impl<T> ArchivePointee for T
impl<T> ArchivePointee for T
Source§type ArchivedMetadata = ()
type ArchivedMetadata = ()
The archived version of the pointer metadata for this type.
Source§fn pointer_metadata(
_: &<T as ArchivePointee>::ArchivedMetadata,
) -> <T as Pointee>::Metadata
fn pointer_metadata( _: &<T as ArchivePointee>::ArchivedMetadata, ) -> <T as Pointee>::Metadata
Converts some archived metadata to the pointer metadata for itself.
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> LayoutRaw for T
impl<T> LayoutRaw for T
Source§fn layout_raw(_: <T as Pointee>::Metadata) -> Result<Layout, LayoutError>
fn layout_raw(_: <T as Pointee>::Metadata) -> Result<Layout, LayoutError>
Returns the layout of the type.
Source§impl<T, N1, N2> Niching<NichedOption<T, N1>> for N2
impl<T, N1, N2> Niching<NichedOption<T, N1>> for N2
Source§unsafe fn is_niched(niched: *const NichedOption<T, N1>) -> bool
unsafe fn is_niched(niched: *const NichedOption<T, N1>) -> bool
Returns whether the given value has been niched. Read more
Source§fn resolve_niched(out: Place<NichedOption<T, N1>>)
fn resolve_niched(out: Place<NichedOption<T, N1>>)
Writes data to
out indicating that a T is niched.