Skip to main content

Module quantized_linear

Module quantized_linear 

Source
Expand description

Quantized Linear (Fully Connected) Layer

INT8 quantized linear layer with:

  • GEMM-based forward pass
  • Fused bias and requantization
  • Per-channel or per-tensor quantization

Structsยง

QuantizedLinear
Quantized Linear Layer