pub struct TgiConfig {
pub endpoint: String,
pub model_id: String,
pub max_total_tokens: Option<u32>,
pub max_input_length: Option<u32>,
pub max_batch_prefill_tokens: Option<u32>,
pub max_concurrent_requests: Option<u32>,
pub max_batch_total_tokens: Option<u32>,
pub timeout_seconds: u64,
}Expand description
TGI service configuration
Fields§
§endpoint: StringService endpoint URL
model_id: StringModel to serve (Hugging Face model ID)
max_total_tokens: Option<u32>Maximum total tokens per request
max_input_length: Option<u32>Maximum input length
max_batch_prefill_tokens: Option<u32>Maximum batch size for prefill
max_concurrent_requests: Option<u32>Maximum number of concurrent requests
max_batch_total_tokens: Option<u32>Maximum batch size for total tokens
timeout_seconds: u64Request timeout in seconds
Trait Implementations§
Auto Trait Implementations§
impl Freeze for TgiConfig
impl RefUnwindSafe for TgiConfig
impl Send for TgiConfig
impl Sync for TgiConfig
impl Unpin for TgiConfig
impl UnwindSafe for TgiConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> IntoRequest<T> for T
impl<T> IntoRequest<T> for T
Source§fn into_request(self) -> Request<T>
fn into_request(self) -> Request<T>
Wrap the input message
T in a tonic::Request