Skip to main content

Module batch

Module batch 

Source
Expand description

Batch processing for high-throughput inference

Provides static and dynamic batching to maximize GPU/CPU utilization and achieve 3-10x throughput improvements.

Structs§

BatchConfig
Configuration for batch processor
BatchProcessor
Batch processor for high-throughput inference
BatchRequest
A single inference request
BatchStats
Statistics about batch processing

Enums§

BatchStrategy
Batch processing strategy