pub struct FP8Config {
pub format: FP8Format,
pub scaling: ScalingStrategy,
pub delayed_scaling: DelayedScalingConfig,
pub stochastic_rounding: bool,
pub clip_to_max: bool,
pub use_hardware_ops: bool,
pub calibration_samples: usize,
}Expand description
FP8 quantization configuration
Fields§
§format: FP8FormatFP8 format to use
scaling: ScalingStrategyScaling strategy
delayed_scaling: DelayedScalingConfigDelayed scaling configuration
stochastic_rounding: boolEnable stochastic rounding for better accuracy
clip_to_max: boolClipping strategy (clip to max or saturate)
use_hardware_ops: boolUse hardware FP8 operations if available
calibration_samples: usizeCalibration samples for initial scale estimation
Trait Implementations§
Source§impl<'de> Deserialize<'de> for FP8Config
impl<'de> Deserialize<'de> for FP8Config
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for FP8Config
impl RefUnwindSafe for FP8Config
impl Send for FP8Config
impl Sync for FP8Config
impl Unpin for FP8Config
impl UnsafeUnpin for FP8Config
impl UnwindSafe for FP8Config
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more