pub struct LoadedModel {
pub engine: InferenceEngine,
pub last_used: Instant,
pub mem_bytes: usize,
pub inflight: u64,
}Expand description
A single loaded model with its engine and bookkeeping data.
Fields§
§engine: InferenceEngineThe owned inference engine.
last_used: InstantMonotonic timestamp of the last request that used this model.
mem_bytes: usizeEstimated resident memory in bytes:
weights_size + max_batch * (kv_size_per_seq + state_size_per_seq).
Used by the pool to enforce the memory budget.
inflight: u64Number of requests currently using this model.
Trait Implementations§
Auto Trait Implementations§
impl !Freeze for LoadedModel
impl !RefUnwindSafe for LoadedModel
impl Send for LoadedModel
impl Sync for LoadedModel
impl Unpin for LoadedModel
impl UnsafeUnpin for LoadedModel
impl !UnwindSafe for LoadedModel
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more