pub struct BatchHint {
pub max_batch_size: usize,
pub max_tokens: usize,
pub target_latency_ms: Option<u64>,
pub available_memory: Option<u64>,
pub resource_constraints: ResourceConstraints,
}Expand description
Batch hint for scheduler optimization
Fields§
§max_batch_size: usizeMaximum batch size
max_tokens: usizeMaximum total tokens in batch
target_latency_ms: Option<u64>Target latency for batch formation
available_memory: Option<u64>Available memory for batch
resource_constraints: ResourceConstraintsResource constraints
Implementations§
Trait Implementations§
Auto Trait Implementations§
impl Freeze for BatchHint
impl RefUnwindSafe for BatchHint
impl Send for BatchHint
impl Sync for BatchHint
impl Unpin for BatchHint
impl UnsafeUnpin for BatchHint
impl UnwindSafe for BatchHint
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more