Module quantization

Source
Expand description

Quantization module Quantization support for neural networks

This module provides comprehensive quantization capabilities including:

  • Post-training quantization (PTQ)
  • Quantization-aware training (QAT)
  • Mixed bit-width operations
  • Dynamic and static quantization schemes

Modules§

utils
Quantization utilities and helper functions

Structs§

DynamicQuantizer
Dynamic quantization at runtime
MixedBitWidthQuantizer
Mixed bit-width quantization support
PostTrainingQuantizer
Post-training quantization (PTQ) implementation
QuantizationAwareTraining
Quantization-aware training (QAT) support
QuantizationConfig
Quantization configuration
QuantizationParams
Quantization parameters for a tensor
QuantizedTensor
Quantized tensor representation

Enums§

QuantizationMode
Quantization mode
QuantizationScheme
Quantization scheme