pub struct InferenceProbeConfig {
pub endpoint: String,
pub model: String,
pub prompt: String,
pub max_tokens: u32,
pub timeout_secs: u64,
pub max_latency_ms: Option<u64>,
}Expand description
Configuration for inference probe health check
Sends a minimal completion request to verify the model can actually process requests, not just that the server is running.
Fields§
§endpoint: StringEndpoint for completion request
model: StringModel to probe (required)
prompt: StringProbe prompt (minimal to reduce cost/latency)
max_tokens: u32Max tokens in response (keep minimal)
timeout_secs: u64Timeout for probe request in seconds
max_latency_ms: Option<u64>Mark unhealthy if probe latency exceeds this threshold (ms)
Trait Implementations§
Source§impl Clone for InferenceProbeConfig
impl Clone for InferenceProbeConfig
Source§fn clone(&self) -> InferenceProbeConfig
fn clone(&self) -> InferenceProbeConfig
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for InferenceProbeConfig
impl Debug for InferenceProbeConfig
Source§impl<'de> Deserialize<'de> for InferenceProbeConfig
impl<'de> Deserialize<'de> for InferenceProbeConfig
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for InferenceProbeConfig
impl PartialEq for InferenceProbeConfig
Source§impl Serialize for InferenceProbeConfig
impl Serialize for InferenceProbeConfig
impl Eq for InferenceProbeConfig
impl StructuralPartialEq for InferenceProbeConfig
Auto Trait Implementations§
impl Freeze for InferenceProbeConfig
impl RefUnwindSafe for InferenceProbeConfig
impl Send for InferenceProbeConfig
impl Sync for InferenceProbeConfig
impl Unpin for InferenceProbeConfig
impl UnwindSafe for InferenceProbeConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more