Expand description
Inference optimization utilities
This module provides optimizations for model inference including:
- Operator fusion
- Constant folding
- Memory optimization
- Batch inference
Structs§
- Batch
Inference - Batch inference helper
- FusedOp
- Fused operation
- Inference
Config - Inference mode configuration
- Inference
Optimizer - Inference optimizer
- Inference
Session - Inference session for optimized model execution
Functions§
- warmup_
model - Warmup helper for inference