Expand description
Graph Rewrite Passes for INT8 Quantization (ADR-091 Phase 3)
This module implements four critical graph optimization passes:
- GR-1: fuse_batchnorm_to_conv - Absorb BatchNorm into Conv weights/bias
- GR-2: fuse_zp_to_bias - Pre-compute zero-point correction in bias
- GR-3: insert_qdq_nodes - Insert Quantize/Dequantize nodes at boundaries
- GR-4: fuse_relu/fuse_hardswish - Merge activations into preceding ops
Structs§
- Computation
Graph - Computation graph for optimization passes
- Graph
Node - Graph node representing a single operation
Enums§
- Node
Params - Parameters for different node types
- Node
Type - Computation graph node types
Functions§
- fuse_
batchnorm_ to_ conv - GR-1: Fuse BatchNorm parameters into Conv weights and bias
- fuse_
hardswish - GR-4: Fuse HardSwish activation using LUT
- fuse_
relu - GR-4: Fuse ReLU activation into preceding convolution
- fuse_
zp_ to_ bias - GR-2: Fuse zero-point correction into bias
- generate_
hardswish_ lut - Generate HardSwish LUT for INT8 quantized values
- insert_
qdq_ nodes - GR-3: Insert Quantize/Dequantize nodes at INT8 subgraph boundaries