pub enum PrecisionMode {
FP32,
FP16,
BF16,
Mixed {
compute: ComputePrecision,
accumulate_fp32: bool,
},
}Expand description
Precision mode for inference
Variants§
FP32
Full precision (FP32)
FP16
Half precision (FP16) - good for NVIDIA GPUs
BF16
Brain float 16 (BF16) - good for modern accelerators
Mixed
Mixed precision - compute in FP16/BF16 but accumulate in FP32
Fields
§
compute: ComputePrecisionCompute precision
Implementations§
Source§impl PrecisionMode
impl PrecisionMode
Sourcepub fn is_reduced_precision(&self) -> bool
pub fn is_reduced_precision(&self) -> bool
Check if this mode uses reduced precision
Sourcepub fn memory_reduction_factor(&self) -> f32
pub fn memory_reduction_factor(&self) -> f32
Get the memory reduction factor compared to FP32
Trait Implementations§
Source§impl Clone for PrecisionMode
impl Clone for PrecisionMode
Source§fn clone(&self) -> PrecisionMode
fn clone(&self) -> PrecisionMode
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for PrecisionMode
impl Debug for PrecisionMode
Source§impl Default for PrecisionMode
impl Default for PrecisionMode
Source§fn default() -> PrecisionMode
fn default() -> PrecisionMode
Returns the “default value” for a type. Read more
Source§impl<'de> Deserialize<'de> for PrecisionMode
impl<'de> Deserialize<'de> for PrecisionMode
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for PrecisionMode
impl PartialEq for PrecisionMode
Source§impl Serialize for PrecisionMode
impl Serialize for PrecisionMode
impl Copy for PrecisionMode
impl Eq for PrecisionMode
impl StructuralPartialEq for PrecisionMode
Auto Trait Implementations§
impl Freeze for PrecisionMode
impl RefUnwindSafe for PrecisionMode
impl Send for PrecisionMode
impl Sync for PrecisionMode
impl Unpin for PrecisionMode
impl UnwindSafe for PrecisionMode
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more