Struct Quantizer

Source

pub struct Quantizer { /* private fields */ }

Expand description

High-level quantizer that combines configuration with optional calibration.

Implementations§

Source §

impl Quantizer

Source

pub fn new(config: QuantConfig) -> Self

Create a quantizer with the given configuration (no calibration).

Source

pub fn with_calibration( config: QuantConfig, stats: HashMap<String, ActivationStats>, ) -> Self

Create a quantizer with configuration and pre-collected activation statistics.

Source

pub fn quantize_tensor_with_name( &self, name: &str, data: &[f32], shape: Vec<usize>, ) -> Result<QuantizedTensorType>

Quantize a tensor with optional calibration.

Source

pub fn quantize_tensor( &self, data: &[f32], shape: Vec<usize>, ) -> Result<QuantizedTensorType>

Quantize a tensor using the configured bit width and per-channel setting.

§Errors

Returns QuantizeError::InvalidTensor or QuantizeError::UnsupportedConfig.

Source

pub fn quantize_model( &self, model: &OnnxModel, ) -> Result<Vec<QuantizedWeightOutput>>

Quantize every weight in model that passes QuantConfig::should_quantize. Honours per-layer bit-width overrides.

When this quantizer was built with calibration, activation-based range optimization is used for the default bit-width; layers whose bit-width is overridden fall back to weight-only quantization (the calibration stats are keyed by the default configuration).

Skipped weights do not appear in the returned vector.

Trait Implementations§

Source §

impl Debug for Quantizer

Source §

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

§

impl UnwindSafe for Quantizer

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> Downcast for T
where T: Any,

Source §

fn into_any(self: Box<T>) -> Box<dyn Any>

Convert Box<dyn Trait> (where Trait: Downcast) to Box<dyn Any>. Box<dyn Any> can then be further downcast into Box<ConcreteType> where ConcreteType implements Trait.

Source §

fn into_any_rc(self: Rc<T>) -> Rc<dyn Any>

Convert Rc<Trait> (where Trait: Downcast) to Rc<Any>. Rc<Any> can then be further downcast into Rc<ConcreteType> where ConcreteType implements Trait.

Source §

fn as_any(&self) -> &(dyn Any + 'static)

Convert &Trait (where Trait: Downcast) to &Any. This is needed since Rust cannot generate &Any’s vtable from &Trait’s.

Source §

fn as_any_mut(&mut self) -> &mut (dyn Any + 'static)

Convert &mut Trait (where Trait: Downcast) to &Any. This is needed since Rust cannot generate &mut Any’s vtable from &mut Trait’s.

Source §

impl<T> DowncastSync for T
where T: Any + Send + Sync,

Source §

fn into_any_arc(self: Arc<T>) -> Arc<dyn Any + Sync + Send>

Convert Arc<Trait> (where Trait: Downcast) to Arc<Any>. Arc<Any> can then be further downcast into Arc<ConcreteType> where ConcreteType implements Trait.

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §