Skip to main content

SmoothQuantLinearDescriptor

Struct SmoothQuantLinearDescriptor 

Source
#[non_exhaustive]
pub struct SmoothQuantLinearDescriptor { pub m: i32, pub n: i32, pub k: i32, pub act_scale: f32, pub activation_element: ElementKind, pub weight_element: ElementKind, pub output_element: ElementKind, }
Expand description

Descriptor for a SmoothQuant linear op.

The per-tensor activation scale lives in the descriptor (not the args) because in the SmoothQuant flow it’s part of the model’s frozen quantization metadata — it doesn’t change between launches for the same layer.

Fields (Non-exhaustive)§

This struct is marked as non-exhaustive
Non-exhaustive structs could have additional fields added in future. Therefore, non-exhaustive structs cannot be constructed in external crates using the traditional Struct { .. } syntax; cannot be matched against without a wildcard ..; and struct update syntax will not work.
§m: i32

Number of token rows in the activation (and rows of the output).

§n: i32

Number of output channels (rows of weight_q, cols of output).

§k: i32

Inner reduction dim (cols of act_q and weight_q).

§act_scale: f32

Per-tensor activation scale produced by the offline SmoothQuant Python flow. Always f32 regardless of TIn — the underlying quantized_linear_w8a8 kernel does the scale multiply in float space irrespective of output dtype.

§activation_element: ElementKind

Activation int element kind. Today wired only for S8.

§weight_element: ElementKind

Weight int element kind. Today wired only for S8.

§output_element: ElementKind

Output FP element kind. Must match TIn::KIND.

Implementations§

Source§

impl SmoothQuantLinearDescriptor

Source

pub fn new<TIn: Element>(m: i32, n: i32, k: i32, act_scale: f32) -> Self

Construct a SmoothQuantLinearDescriptor for the given problem shape and per-tensor activation scale. Defaults S8 for both activation and weight; output element matches TIn::KIND.

Trait Implementations§

Source§

impl Clone for SmoothQuantLinearDescriptor

Source§

fn clone(&self) -> SmoothQuantLinearDescriptor

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Copy for SmoothQuantLinearDescriptor

Source§

impl Debug for SmoothQuantLinearDescriptor

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.