Skip to main content

QuantizationLevel

ai_hwaccel::quantization

Enum QuantizationLevel

#[non_exhaustive]pub enum QuantizationLevel {
    None,
    Float16,
    BFloat16,
    Int8,
    Int4,
}

Expand description

Model weight quantisation levels.

§Examples

use ai_hwaccel::QuantizationLevel;

let q = QuantizationLevel::Int8;
assert_eq!(q.bits_per_param(), 8);
assert!((q.memory_reduction_factor() - 4.0).abs() < f64::EPSILON);

Variants (Non-exhaustive)§

This enum is marked as non-exhaustive

Non-exhaustive enums could have additional variants added in future. Therefore, when matching against variants of non-exhaustive enums, an extra wildcard arm must be added to account for any future variants.

None

Full precision — FP32, 32 bits per parameter.

Float16

Half precision — FP16, 16 bits per parameter.

BFloat16

Brain floating point — BF16, 16 bits per parameter.

Int8

8-bit integer quantisation.

Int4

4-bit integer quantisation (GPTQ / AWQ style).

Implementations§

impl QuantizationLevel

pub const fn bits_per_param(&self) -> u32

Number of bits used per model parameter.

pub const fn memory_reduction_factor(&self) -> f64

Memory reduction factor relative to FP32.

Trait Implementations§

impl Clone for QuantizationLevel

fn clone(&self) -> QuantizationLevel

Returns a duplicate of the value. Read more

1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Copy for QuantizationLevel

impl Debug for QuantizationLevel

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl<'de> Deserialize<'de> for QuantizationLevel

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl Display for QuantizationLevel

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Eq for QuantizationLevel

impl Hash for QuantizationLevel

fn hash<H: Hasher>(&self, state: &mut H)

Feeds this value into the given Hasher. Read more

1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more

impl PartialEq for QuantizationLevel

fn eq(&self, other: &QuantizationLevel) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 (const: unstable) · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

impl Serialize for QuantizationLevel

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

impl StructuralPartialEq for QuantizationLevel

impl TryFrom<u32> for QuantizationLevel

fn try_from(bits: u32) -> Result<Self, u32>

Convert from bit width to quantisation level.

32 → None (FP32)
16 → Float16
8 → Int8
4 → Int4

type Error = u32

The type returned in the event of a conversion error.

Auto Trait Implementations§

impl Freeze for QuantizationLevel

impl RefUnwindSafe for QuantizationLevel

impl Send for QuantizationLevel

impl Sync for QuantizationLevel

impl Unpin for QuantizationLevel

impl UnsafeUnpin for QuantizationLevel

impl UnwindSafe for QuantizationLevel

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T> ToString for T
where T: Display + ?Sized,

fn to_string(&self) -> String

Converts the given value to a String. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more