pub enum ScalingStrategy {
PerTensor,
PerChannel,
PerToken,
BlockWise {
block_size: usize,
},
}Expand description
Scaling strategy for FP8 quantization
Variants§
PerTensor
Per-tensor scaling: single scale factor for entire tensor
PerChannel
Per-channel scaling: scale factor per output channel
PerToken
Per-token scaling: scale factor per token (for sequence models)
BlockWise
Block-wise scaling: scale factor per fixed-size block
Trait Implementations§
Source§impl Clone for ScalingStrategy
impl Clone for ScalingStrategy
Source§fn clone(&self) -> ScalingStrategy
fn clone(&self) -> ScalingStrategy
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for ScalingStrategy
impl Debug for ScalingStrategy
Source§impl<'de> Deserialize<'de> for ScalingStrategy
impl<'de> Deserialize<'de> for ScalingStrategy
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for ScalingStrategy
impl PartialEq for ScalingStrategy
Source§impl Serialize for ScalingStrategy
impl Serialize for ScalingStrategy
impl Copy for ScalingStrategy
impl Eq for ScalingStrategy
impl StructuralPartialEq for ScalingStrategy
Auto Trait Implementations§
impl Freeze for ScalingStrategy
impl RefUnwindSafe for ScalingStrategy
impl Send for ScalingStrategy
impl Sync for ScalingStrategy
impl Unpin for ScalingStrategy
impl UnsafeUnpin for ScalingStrategy
impl UnwindSafe for ScalingStrategy
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
impl<T> ConfigSerializable for Twhere
T: Serialize + for<'de> Deserialize<'de>,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more