pub enum QuantizationPrecision {
Int1,
Int2,
Int4,
Int8,
FP16,
BF16,
Custom {
bits: u8,
},
Dynamic,
}Expand description
Quantization precision formats
Variants§
Int1
1-bit quantization (binary)
Int2
2-bit quantization
Int4
4-bit quantization
Int8
8-bit quantization
FP16
16-bit floating point
BF16
16-bit brain floating point
Custom
Custom precision
Dynamic
Dynamic precision based on value range
Trait Implementations§
Source§impl Clone for QuantizationPrecision
impl Clone for QuantizationPrecision
Source§fn clone(&self) -> QuantizationPrecision
fn clone(&self) -> QuantizationPrecision
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for QuantizationPrecision
impl Debug for QuantizationPrecision
Source§impl<'de> Deserialize<'de> for QuantizationPrecision
impl<'de> Deserialize<'de> for QuantizationPrecision
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for QuantizationPrecision
impl PartialEq for QuantizationPrecision
Source§impl Serialize for QuantizationPrecision
impl Serialize for QuantizationPrecision
impl Copy for QuantizationPrecision
impl Eq for QuantizationPrecision
impl StructuralPartialEq for QuantizationPrecision
Auto Trait Implementations§
impl Freeze for QuantizationPrecision
impl RefUnwindSafe for QuantizationPrecision
impl Send for QuantizationPrecision
impl Sync for QuantizationPrecision
impl Unpin for QuantizationPrecision
impl UnsafeUnpin for QuantizationPrecision
impl UnwindSafe for QuantizationPrecision
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
Source§fn save_to_file(&self, path: &Path) -> Result<(), TrustformersError>
fn save_to_file(&self, path: &Path) -> Result<(), TrustformersError>
Save to file
Source§fn load_from_file(path: &Path) -> Result<Self, TrustformersError>where
Self: Sized,
fn load_from_file(path: &Path) -> Result<Self, TrustformersError>where
Self: Sized,
Load from file
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more