pub enum QuantStrategy {
None,
INT8WeightOnly,
FP16,
BF16,
INT8Dynamic,
MixedPrecision,
}Expand description
Quantization strategy
Variants§
None
No quantization (FP32)
INT8WeightOnly
Quantize weights to INT8, keep activations in FP32
FP16
Quantize weights to FP16
BF16
Quantize weights to BF16
INT8Dynamic
Quantize both weights and activations to INT8 (dynamic)
MixedPrecision
Mixed precision: sensitive layers in FP32, others in INT8
Implementations§
Source§impl QuantStrategy
impl QuantStrategy
Sourcepub fn compression_ratio(&self) -> f32
pub fn compression_ratio(&self) -> f32
Get memory compression ratio
Sourcepub fn quantizes_weights(&self) -> bool
pub fn quantizes_weights(&self) -> bool
Check if strategy quantizes weights
Sourcepub fn quantizes_activations(&self) -> bool
pub fn quantizes_activations(&self) -> bool
Check if strategy quantizes activations
Trait Implementations§
Source§impl Clone for QuantStrategy
impl Clone for QuantStrategy
Source§fn clone(&self) -> QuantStrategy
fn clone(&self) -> QuantStrategy
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for QuantStrategy
impl Debug for QuantStrategy
Source§impl PartialEq for QuantStrategy
impl PartialEq for QuantStrategy
impl Copy for QuantStrategy
impl Eq for QuantStrategy
impl StructuralPartialEq for QuantStrategy
Auto Trait Implementations§
impl Freeze for QuantStrategy
impl RefUnwindSafe for QuantStrategy
impl Send for QuantStrategy
impl Sync for QuantStrategy
impl Unpin for QuantStrategy
impl UnwindSafe for QuantStrategy
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more