Skip to main content

Module scheduler

Module scheduler 

Source
Expand description

Scheduler - Continuous Batching Request Scheduler

Manages multiple inference requests with dynamic batching. Supports adding/removing requests mid-inference.

§Architecture

  • FIFO queue for pending requests
  • Active batch with configurable max size
  • Preemption support for priority requests (future)

Structs§

Batch
Batch of requests currently being processed
Request
Inference request
Scheduler
Continuous batching scheduler
SchedulerConfig
Scheduler configuration
SchedulerStats
Scheduler statistics

Enums§

Priority
Request priority (higher = more urgent)
RequestState
Request state

Type Aliases§

RequestId
Unique request identifier