pub struct BatchConfig {
pub max_batch_size: usize,
pub max_wait_ms: u64,
pub enable_dynamic: bool,
pub enable_continuous: bool,
pub max_num_batched_tokens: usize,
}Fields§
§max_batch_size: usize§max_wait_ms: u64§enable_dynamic: bool§enable_continuous: bool§max_num_batched_tokens: usizevLLM-style per-iteration token budget. The scheduler emits a
mixed prefill+decode batch summing to at most this many Q
tokens (decode = 1 each, prefill chunk = its chunk size).
Default 2048. Runtime snapshots can override this with
FERRUM_MAX_BATCHED_TOKENS, usually from the GPU autosizer or a
named workload preset rather than a user hand-written env bundle.
Trait Implementations§
Source§impl Clone for BatchConfig
impl Clone for BatchConfig
Source§fn clone(&self) -> BatchConfig
fn clone(&self) -> BatchConfig
Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for BatchConfig
impl Debug for BatchConfig
Source§impl Default for BatchConfig
impl Default for BatchConfig
Source§impl<'de> Deserialize<'de> for BatchConfig
impl<'de> Deserialize<'de> for BatchConfig
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations§
impl Freeze for BatchConfig
impl RefUnwindSafe for BatchConfig
impl Send for BatchConfig
impl Sync for BatchConfig
impl Unpin for BatchConfig
impl UnsafeUnpin for BatchConfig
impl UnwindSafe for BatchConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more