pub struct LoadAwareMoERouter { /* private fields */ }Expand description
Load-aware MoE router
Routes to the least loaded available expert, with optional affinity for consistent routing when loads are similar.
Implementations§
Source§impl LoadAwareMoERouter
impl LoadAwareMoERouter
Sourcepub fn with_affinity_threshold(self, threshold: f64) -> Self
pub fn with_affinity_threshold(self, threshold: f64) -> Self
Set affinity threshold
If load difference is below this threshold, prefer consistent routing
Trait Implementations§
Source§impl Clone for LoadAwareMoERouter
impl Clone for LoadAwareMoERouter
Source§fn clone(&self) -> LoadAwareMoERouter
fn clone(&self) -> LoadAwareMoERouter
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for LoadAwareMoERouter
impl Debug for LoadAwareMoERouter
Source§impl Default for LoadAwareMoERouter
impl Default for LoadAwareMoERouter
Source§impl MoERouter for LoadAwareMoERouter
impl MoERouter for LoadAwareMoERouter
Source§fn route<'life0, 'life1, 'async_trait>(
&'life0 self,
input: &'life1 str,
num_experts: usize,
) -> Pin<Box<dyn Future<Output = RouteResult> + Send + 'async_trait>>where
Self: 'async_trait,
'life0: 'async_trait,
'life1: 'async_trait,
fn route<'life0, 'life1, 'async_trait>(
&'life0 self,
input: &'life1 str,
num_experts: usize,
) -> Pin<Box<dyn Future<Output = RouteResult> + Send + 'async_trait>>where
Self: 'async_trait,
'life0: 'async_trait,
'life1: 'async_trait,
Route an input to an expert Read more
Source§fn route_with_experts<'life0, 'life1, 'life2, 'async_trait>(
&'life0 self,
input: &'life1 str,
experts: &'life2 [Expert],
) -> Pin<Box<dyn Future<Output = RouteResult> + Send + 'async_trait>>where
Self: 'async_trait,
'life0: 'async_trait,
'life1: 'async_trait,
'life2: 'async_trait,
fn route_with_experts<'life0, 'life1, 'life2, 'async_trait>(
&'life0 self,
input: &'life1 str,
experts: &'life2 [Expert],
) -> Pin<Box<dyn Future<Output = RouteResult> + Send + 'async_trait>>where
Self: 'async_trait,
'life0: 'async_trait,
'life1: 'async_trait,
'life2: 'async_trait,
Route with expert health information Read more
Auto Trait Implementations§
impl Freeze for LoadAwareMoERouter
impl RefUnwindSafe for LoadAwareMoERouter
impl Send for LoadAwareMoERouter
impl Sync for LoadAwareMoERouter
impl Unpin for LoadAwareMoERouter
impl UnwindSafe for LoadAwareMoERouter
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more