Expand description
Quantized Linear (Fully Connected) Layer
INT8 quantized linear layer with:
- GEMM-based forward pass
- Fused bias and requantization
- Per-channel or per-tensor quantization
Structsยง
- Quantized
Linear - Quantized Linear Layer
Quantized Linear (Fully Connected) Layer
INT8 quantized linear layer with: