QuantScheme

cubecl_common::quant::scheme

Struct QuantScheme

pub struct QuantScheme {
    pub value: QuantValue,
    pub param: QuantParam,
    pub store: QuantStore,
    pub level: QuantLevel,
    pub mode: QuantMode,
}

Expand description

Describes a quantization scheme/configuration.

Fields§

§value: QuantValue

The logical data type of quantized input values (e.g., QInt8).

This defines how values are interpreted during computation, independent of how they’re stored.

§param: QuantParam

Precision used for quantization parameters (e.g., scale and biases).

§store: QuantStore

Data type used for storing quantized values.

§level: QuantLevel

Granularity level of quantization (e.g., per-tensor).

§mode: QuantMode

Quantization mode (e.g., symmetric).

Implementations§

impl QuantScheme

pub fn with_level(self, level: QuantLevel) -> Self

Set the quantization level.

pub fn with_mode(self, mode: QuantMode) -> Self

Set the quantization mode.

pub fn with_value(self, value: QuantValue) -> Self

Set the data type used for quantized values.

pub fn with_store(self, store: QuantStore) -> Self

Set the data type used to store quantized values.

pub fn with_param(self, param: QuantParam) -> Self

Set the precision used for quantization parameters

pub fn size_bits_stored(&self) -> usize

Returns the size of the quantization storage type in bits.

pub fn size_bits_value(&self) -> usize

Returns the size of the quantization storage type in bits.

pub fn num_quants(&self) -> usize

Returns the number of quantized values stored in a single element.

pub fn native_packing(&self) -> usize

Returns the native packing factor for the values. When native packing > 1, the packed representation stores num_quants elements grouped into packs of native_packing size.

Trait Implementations§

impl Clone for QuantScheme

fn clone(&self) -> QuantScheme

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for QuantScheme

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Default for QuantScheme

fn default() -> Self

Returns the “default value” for a type. Read more

impl<'de> Deserialize<'de> for QuantScheme

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl Hash for QuantScheme

fn hash<H: Hasher>(&self, state: &mut H)

Feeds this value into the given Hasher. Read more

1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more

impl Ord for QuantScheme

fn cmp(&self, other: &QuantScheme) -> Ordering

This method returns an Ordering between self and other. Read more

1.21.0 · Source§

fn max(self, other: Self) -> Self
where Self: Sized,

Compares and returns the maximum of two values. Read more

1.21.0 · Source§

fn min(self, other: Self) -> Self
where Self: Sized,

Compares and returns the minimum of two values. Read more

1.50.0 · Source§

fn clamp(self, min: Self, max: Self) -> Self
where Self: Sized,

Restrict a value to a certain interval. Read more

impl PartialEq for QuantScheme

fn eq(&self, other: &QuantScheme) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

impl PartialOrd for QuantScheme

fn partial_cmp(&self, other: &QuantScheme) -> Option<Ordering>

This method returns an ordering between self and other values if one exists. Read more

1.0.0 · Source§

fn lt(&self, other: &Rhs) -> bool

Tests less than (for self and other) and is used by the < operator. Read more

1.0.0 · Source§

fn le(&self, other: &Rhs) -> bool

Tests less than or equal to (for self and other) and is used by the <= operator. Read more

1.0.0 · Source§

fn gt(&self, other: &Rhs) -> bool

Tests greater than (for self and other) and is used by the > operator. Read more

1.0.0 · Source§

fn ge(&self, other: &Rhs) -> bool

Tests greater than or equal to (for self and other) and is used by the >= operator. Read more

impl Serialize for QuantScheme

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

impl Copy for QuantScheme

impl Eq for QuantScheme

impl StructuralPartialEq for QuantScheme

Auto Trait Implementations§

impl Freeze for QuantScheme

impl RefUnwindSafe for QuantScheme

impl Send for QuantScheme

impl Sync for QuantScheme

impl Unpin for QuantScheme

impl UnwindSafe for QuantScheme

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<Q, K> Comparable<K> for Q
where Q: Ord + ?Sized, K: Borrow<Q> + ?Sized,

fn compare(&self, key: &K) -> Ordering

Compare self to key and return their ordering.

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

fn equivalent(&self, key: &K) -> bool

Compare self to key and return true if they are equal.

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn vzip(self) -> V

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,