Struct BatchConfig

Source

pub struct BatchConfig {
    pub max_batch_size: usize,
    pub max_latency: Duration,
    pub queue_capacity: Option<usize>,
    pub response_timeout: Option<Duration>,
    pub max_in_flight_per_feed: usize,
    pub startup_timeout: Option<Duration>,
}

Expand description

Configuration for a batch coordinator.

Controls batch formation: how many items accumulate before dispatch and how long to wait for a full batch.

§Tradeoffs

max_batch_size: Larger batches improve throughput (better GPU utilization) but increase per-frame latency because each frame waits for the batch to fill.
max_latency: Lower values reduce worst-case latency for partial batches but may dispatch smaller, less efficient batches.

Reasonable starting points for multi-feed inference:

max_batch_size: 4–16 (depends on GPU memory / model size)
max_latency: 20–100ms (depends on frame rate / latency tolerance)

Fields§

§max_batch_size: usize

Maximum items in a single batch.

When this many items accumulate, the batch is dispatched immediately without waiting for max_latency.

Must be ≥ 1.

§max_latency: Duration

Maximum time to wait for a full batch before dispatching a partial one.

After the first item arrives, the coordinator waits up to this duration for more items. If the batch is still not full when the deadline expires, it is dispatched as-is.

Must be > 0.

§queue_capacity: Option<usize>

Submission queue capacity.

Controls how many pending items can be buffered before submit_and_wait returns BatchSubmitError::QueueFull.

Defaults to max_batch_size * 4 (minimum 4) when None. When specified, must be ≥ max_batch_size.

§response_timeout: Option<Duration>

Safety timeout added beyond max_latency when a feed thread waits for a batch response.

The total wait is max_latency + response_timeout. This bounds how long a feed thread can block if the coordinator is wedged or processing is severely delayed.

In practice, responses arrive within max_latency + processing_time. This safety margin exists only to guarantee eventual unblocking.

Defaults to 5 seconds when None. Must be > 0 when specified.

§max_in_flight_per_feed: usize

Maximum number of in-flight submissions allowed per feed.

An item is “in-flight” from the moment it enters the submission queue until the coordinator routes its result back (or drains it at shutdown). When a feed reaches this limit, further submit_and_wait calls fail immediately with BatchSubmitError::InFlightCapReached rather than adding to the queue.

This prevents a feed from accumulating orphaned items in the shared queue after timeouts: when submit_and_wait times out, the item remains in-flight inside the coordinator. Without a cap, the feed could immediately submit another frame, stacking multiple items and crowding other feeds.

Default: 1 — each feed contributes at most one item to the shared queue at any time. Must be ≥ 1.

§startup_timeout: Option<Duration>

Maximum time to wait for BatchProcessor::on_start() to complete before returning an error.

GPU-backed processors (e.g. TensorRT engine compilation) may need significantly longer than CPU-only models. Set this to accommodate worst-case first-run warm-up on the target hardware.

Defaults to 30 seconds when None. Must be > 0 when specified.

Struct BatchConfig Copy item path

§Tradeoffs

Fields§

Implementations§

impl BatchConfig

pub fn new( max_batch_size: usize, max_latency: Duration, ) -> Result<Self, ConfigError>

§Errors

pub fn with_queue_capacity(self, capacity: Option<usize>) -> Self

pub fn with_response_timeout(self, timeout: Option<Duration>) -> Self

pub fn with_max_in_flight_per_feed(self, max: usize) -> Self

pub fn with_startup_timeout(self, timeout: Option<Duration>) -> Self

pub fn validate(&self) -> Result<(), ConfigError>

§Errors

Trait Implementations§

impl Clone for BatchConfig

fn clone(&self) -> BatchConfig

fn clone_from(&mut self, source: &Self)

impl Debug for BatchConfig

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for BatchConfig

fn default() -> Self

Auto Trait Implementations§

impl Freeze for BatchConfig

impl RefUnwindSafe for BatchConfig

impl Send for BatchConfig

impl Sync for BatchConfig

impl Unpin for BatchConfig

impl UnsafeUnpin for BatchConfig

impl UnwindSafe for BatchConfig

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>where S: Into<Dispatch>,

fn with_current_subscriber(self) -> WithDispatch<Self>

Struct BatchConfig

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,