oxicuda-quant 0.1.2

GPU-accelerated quantization and model compression engine for OxiCUDA
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
//! # Quantization-Aware Training (QAT)
//!
//! This module provides the building blocks for QAT:
//!
//! | Module        | Contents                                         |
//! |---------------|--------------------------------------------------|
//! | `observer`    | MinMax, MovingAvg, Histogram calibration        |
//! | `fake_quant`  | FakeQuantize with Straight-Through Estimator     |

pub mod fake_quant;
pub mod observer;

pub use fake_quant::FakeQuantize;
pub use observer::{HistogramObserver, MinMaxObserver, MovingAvgObserver, Observer};