QuantizationMethod

Enum QuantizationMethod 

Source
pub enum QuantizationMethod {
    Uniform,
    Symmetric,
    Affine,
    PowerOfTwo,
    Int4,
    UInt4,
    Float16,
    BFloat16,
    PerChannelSymmetric,
    PerChannelAffine,
}
Expand description

Supported methods of quantization

Variants§

§

Uniform

Uniform quantization maps the input range to uniform discrete levels with equal spacing between consecutive levels

§

Symmetric

Symmetric quantization is centered around zero and has equal positive and negative ranges, making it suitable for weight matrices

§

Affine

Affine quantization uses the formula q = scale * (x - zero_point) allowing better representation of asymmetric distributions

§

PowerOfTwo

Power-of-two quantization uses powers of 2 for the scale factor, enabling efficient implementation with bitshifts

§

Int4

Int4 quantization uses 4-bit signed integers, packing two values into each byte for memory efficiency. This is useful for model compression in ML applications.

§

UInt4

UInt4 quantization uses 4-bit unsigned integers, packing two values into each byte. This provides a positive-only range with maximum memory efficiency.

§

Float16

Float16 quantization uses IEEE 754 16-bit half-precision floating point format. It provides a good balance between precision and memory efficiency for ML models.

§

BFloat16

BFloat16 quantization uses the “brain floating point” 16-bit format, which has the same exponent size as f32 but fewer mantissa bits. This is especially well-suited for deep learning applications.

§

PerChannelSymmetric

Per-channel symmetric quantization applies different symmetric quantization parameters to each channel (column), improving accuracy for matrices with varying distributions across channels.

§

PerChannelAffine

Per-channel affine quantization applies different affine quantization parameters to each channel (column), allowing for better representation of asymmetric distributions that vary by channel.

Trait Implementations§

Source§

impl Clone for QuantizationMethod

Source§

fn clone(&self) -> QuantizationMethod

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for QuantizationMethod

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl PartialEq for QuantizationMethod

Source§

fn eq(&self, other: &QuantizationMethod) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl Copy for QuantizationMethod

Source§

impl Eq for QuantizationMethod

Source§

impl StructuralPartialEq for QuantizationMethod

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V