Expand description
Fusion of consecutive quantized operations
This module provides optimized implementations for fusing multiple quantized operations, avoiding the overhead of intermediate dequantization and requantization steps in performance-critical code paths.
Functionsยง
- fused_
quantized_ matmul_ chain - Fused quantized matrix multiplication chain
- fused_
quantized_ matvec_ sequence - Fused quantized matrix-vector multiplication sequence