pub enum QuantizationScheme {
Int8,
Int4,
Dynamic,
DynamicINT8,
GPTQ,
AWQ,
BnB8bit,
BnB4bit,
BnB4bitFP4,
}Expand description
Quantization schemes supported by TrustformeRS
Variants§
Int8
8-bit integer quantization
Int4
4-bit integer quantization (weight-only)
Dynamic
Dynamic quantization (runtime quantization)
DynamicINT8
Dynamic 8-bit integer quantization (runtime quantization)
GPTQ
GPTQ (Gradient-based Post-Training Quantization)
AWQ
AWQ (Activation-aware Weight Quantization)
BnB8bit
BitsAndBytes 8-bit quantization
BnB4bit
BitsAndBytes 4-bit NormalFloat quantization
BnB4bitFP4
BitsAndBytes 4-bit Float16 quantization
Trait Implementations§
Source§impl Clone for QuantizationScheme
impl Clone for QuantizationScheme
Source§fn clone(&self) -> QuantizationScheme
fn clone(&self) -> QuantizationScheme
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for QuantizationScheme
impl Debug for QuantizationScheme
Source§impl<'de> Deserialize<'de> for QuantizationScheme
impl<'de> Deserialize<'de> for QuantizationScheme
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for QuantizationScheme
impl PartialEq for QuantizationScheme
Source§impl Serialize for QuantizationScheme
impl Serialize for QuantizationScheme
impl Copy for QuantizationScheme
impl Eq for QuantizationScheme
impl StructuralPartialEq for QuantizationScheme
Auto Trait Implementations§
impl Freeze for QuantizationScheme
impl RefUnwindSafe for QuantizationScheme
impl Send for QuantizationScheme
impl Sync for QuantizationScheme
impl Unpin for QuantizationScheme
impl UnsafeUnpin for QuantizationScheme
impl UnwindSafe for QuantizationScheme
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more