pub struct CandleConfig {
pub source: String,
pub local_path: String,
pub filename: Option<String>,
pub chat_template: String,
pub device: String,
pub embedding_repo: Option<String>,
pub hf_token: Option<String>,
pub generation: GenerationParams,
pub inference_timeout_secs: u64,
}Fields§
§source: String§local_path: String§filename: Option<String>§chat_template: String§device: String§embedding_repo: Option<String>§hf_token: Option<String>Resolved HuggingFace Hub API token for authenticated model downloads.
Must be the token value — resolved by the caller before constructing this config.
generation: GenerationParams§inference_timeout_secs: u64Maximum seconds to wait for each half of a single inference request.
The timeout is applied twice per chat() call: once for the channel send
(waiting for a free slot) and once for the oneshot reply (waiting for the worker
to finish). The effective maximum wall-clock wait per request is therefore
2 × inference_timeout_secs. CPU inference can be slow; 120s is a conservative
default for large models, giving up to 240s total before an error is returned.
Values of 0 are silently promoted to 1 at bootstrap.
Trait Implementations§
Source§impl Debug for CandleConfig
impl Debug for CandleConfig
Source§impl<'de> Deserialize<'de> for CandleConfig
impl<'de> Deserialize<'de> for CandleConfig
Source§fn deserialize<__D>(
__deserializer: __D,
) -> Result<CandleConfig, <__D as Deserializer<'de>>::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(
__deserializer: __D,
) -> Result<CandleConfig, <__D as Deserializer<'de>>::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl Serialize for CandleConfig
impl Serialize for CandleConfig
Source§fn serialize<__S>(
&self,
__serializer: __S,
) -> Result<<__S as Serializer>::Ok, <__S as Serializer>::Error>where
__S: Serializer,
fn serialize<__S>(
&self,
__serializer: __S,
) -> Result<<__S as Serializer>::Ok, <__S as Serializer>::Error>where
__S: Serializer,
Serialize this value into the given Serde serializer. Read more
Auto Trait Implementations§
impl Freeze for CandleConfig
impl RefUnwindSafe for CandleConfig
impl Send for CandleConfig
impl Sync for CandleConfig
impl Unpin for CandleConfig
impl UnsafeUnpin for CandleConfig
impl UnwindSafe for CandleConfig
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> IntoRequest<T> for T
impl<T> IntoRequest<T> for T
Source§fn into_request(self) -> Request<T>
fn into_request(self) -> Request<T>
Wrap the input message
T in a tonic::Request