pub enum RoutingWorkload {
Interactive,
Batch,
Background,
LocalPreferred,
}Expand description
How latency-sensitive this generation request is.
Variants§
Interactive
User-facing, interactive request where latency matters.
Batch
Batch job where latency matters somewhat, but quality/cost matter more.
Background
Background or offline work where latency is a weak concern.
LocalPreferred
Caller explicitly prefers on-device models. Distinct from
Background (which is “this is a background job, latency
barely matters”). The caller may be doing latency-sensitive
interactive work but wants the privacy / cost / offline
properties of local inference. Same local_bonus as
Background plus a slightly more quality-aware weight profile
— the caller chose local for a reason, not because the work is
throwaway.
Implementations§
Trait Implementations§
Source§impl Clone for RoutingWorkload
impl Clone for RoutingWorkload
Source§fn clone(&self) -> RoutingWorkload
fn clone(&self) -> RoutingWorkload
Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for RoutingWorkload
impl Debug for RoutingWorkload
Source§impl Default for RoutingWorkload
impl Default for RoutingWorkload
Source§fn default() -> RoutingWorkload
fn default() -> RoutingWorkload
Returns the “default value” for a type. Read more
Source§impl<'de> Deserialize<'de> for RoutingWorkload
impl<'de> Deserialize<'de> for RoutingWorkload
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for RoutingWorkload
impl PartialEq for RoutingWorkload
Source§fn eq(&self, other: &RoutingWorkload) -> bool
fn eq(&self, other: &RoutingWorkload) -> bool
Tests for
self and other values to be equal, and is used by ==.Source§impl Serialize for RoutingWorkload
impl Serialize for RoutingWorkload
impl Copy for RoutingWorkload
impl Eq for RoutingWorkload
impl StructuralPartialEq for RoutingWorkload
Auto Trait Implementations§
impl Freeze for RoutingWorkload
impl RefUnwindSafe for RoutingWorkload
impl Send for RoutingWorkload
impl Sync for RoutingWorkload
impl Unpin for RoutingWorkload
impl UnsafeUnpin for RoutingWorkload
impl UnwindSafe for RoutingWorkload
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<Q, K> Equivalent<K> for Q
impl<Q, K> Equivalent<K> for Q
Source§fn equivalent(&self, key: &K) -> bool
fn equivalent(&self, key: &K) -> bool
Compare self to
key and return true if they are equal.Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more