pub enum QuantizationPrecision {
Int4,
Int8,
Int16,
Float16,
Mixed,
}Expand description
Quantization precision levels
Variants§
Int4
4-bit integer (extreme compression)
Int8
8-bit integer (standard quantization)
Int16
16-bit integer
Float16
16-bit floating point (half precision)
Mixed
Mixed precision (different precisions for different layers)
Implementations§
Source§impl QuantizationPrecision
impl QuantizationPrecision
Sourcepub fn bits_per_param(&self) -> u8
pub fn bits_per_param(&self) -> u8
Get the bits per parameter for this precision
Sourcepub fn memory_reduction_ratio(&self) -> f32
pub fn memory_reduction_ratio(&self) -> f32
Get memory reduction ratio compared to FP32
Trait Implementations§
Source§impl Clone for QuantizationPrecision
impl Clone for QuantizationPrecision
Source§fn clone(&self) -> QuantizationPrecision
fn clone(&self) -> QuantizationPrecision
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for QuantizationPrecision
impl Debug for QuantizationPrecision
Source§impl<'de> Deserialize<'de> for QuantizationPrecision
impl<'de> Deserialize<'de> for QuantizationPrecision
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl Hash for QuantizationPrecision
impl Hash for QuantizationPrecision
Source§impl PartialEq for QuantizationPrecision
impl PartialEq for QuantizationPrecision
Source§impl Serialize for QuantizationPrecision
impl Serialize for QuantizationPrecision
impl Copy for QuantizationPrecision
impl Eq for QuantizationPrecision
impl StructuralPartialEq for QuantizationPrecision
Auto Trait Implementations§
impl Freeze for QuantizationPrecision
impl RefUnwindSafe for QuantizationPrecision
impl Send for QuantizationPrecision
impl Sync for QuantizationPrecision
impl Unpin for QuantizationPrecision
impl UnsafeUnpin for QuantizationPrecision
impl UnwindSafe for QuantizationPrecision
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more