pub struct Client {
pub inner: GrpcInferenceServiceClient<InterceptedService<Channel, AuthInterceptor>>,
}
Expand description
Triton Client
Fields§
§inner: GrpcInferenceServiceClient<InterceptedService<Channel, AuthInterceptor>>
Raw grpc client interfaces automatically generated by tonic
Should not necessary to use this interface directly in most cases
Implementations§
Source§impl Client
impl Client
Sourcepub async fn new(
url: impl TryInto<Uri, Error = InvalidUri>,
access_token: Option<String>,
) -> Result<Self, Error>
pub async fn new( url: impl TryInto<Uri, Error = InvalidUri>, access_token: Option<String>, ) -> Result<Self, Error>
Create a new triton client for the given url.
Sourcepub async fn server_live(&self) -> Result<ServerLiveResponse, Error>
pub async fn server_live(&self) -> Result<ServerLiveResponse, Error>
Check liveness of the inference server.
Sourcepub async fn server_ready(&self) -> Result<ServerReadyResponse, Error>
pub async fn server_ready(&self) -> Result<ServerReadyResponse, Error>
Check readiness of the inference server.
Sourcepub async fn model_ready(
&self,
req: ModelReadyRequest,
) -> Result<ModelReadyResponse, Error>
pub async fn model_ready( &self, req: ModelReadyRequest, ) -> Result<ModelReadyResponse, Error>
Check readiness of a model in the inference server.
Sourcepub async fn server_metadata(&self) -> Result<ServerMetadataResponse, Error>
pub async fn server_metadata(&self) -> Result<ServerMetadataResponse, Error>
Get server metadata.
Sourcepub async fn model_metadata(
&self,
req: ModelMetadataRequest,
) -> Result<ModelMetadataResponse, Error>
pub async fn model_metadata( &self, req: ModelMetadataRequest, ) -> Result<ModelMetadataResponse, Error>
Get model metadata.
Sourcepub async fn model_infer(
&self,
req: ModelInferRequest,
) -> Result<ModelInferResponse, Error>
pub async fn model_infer( &self, req: ModelInferRequest, ) -> Result<ModelInferResponse, Error>
Perform inference using a specific model.
Sourcepub async fn model_config(
&self,
req: ModelConfigRequest,
) -> Result<ModelConfigResponse, Error>
pub async fn model_config( &self, req: ModelConfigRequest, ) -> Result<ModelConfigResponse, Error>
Get model configuration.
Sourcepub async fn model_statistics(
&self,
req: ModelStatisticsRequest,
) -> Result<ModelStatisticsResponse, Error>
pub async fn model_statistics( &self, req: ModelStatisticsRequest, ) -> Result<ModelStatisticsResponse, Error>
Get the cumulative inference statistics for a model.
Sourcepub async fn repository_index(
&self,
req: RepositoryIndexRequest,
) -> Result<RepositoryIndexResponse, Error>
pub async fn repository_index( &self, req: RepositoryIndexRequest, ) -> Result<RepositoryIndexResponse, Error>
Get the index of model repository contents.
Sourcepub async fn repository_model_load(
&self,
req: RepositoryModelLoadRequest,
) -> Result<RepositoryModelLoadResponse, Error>
pub async fn repository_model_load( &self, req: RepositoryModelLoadRequest, ) -> Result<RepositoryModelLoadResponse, Error>
Load or reload a model from a repository.
Sourcepub async fn repository_model_unload(
&self,
req: RepositoryModelUnloadRequest,
) -> Result<RepositoryModelUnloadResponse, Error>
pub async fn repository_model_unload( &self, req: RepositoryModelUnloadRequest, ) -> Result<RepositoryModelUnloadResponse, Error>
Unload a model.
Get the status of all registered system-shared-memory regions.
Register a system-shared-memory region.
Unregister a system-shared-memory region.
Get the status of all registered CUDA-shared-memory regions.
Register a CUDA-shared-memory region.
Unregister a CUDA-shared-memory region.
Sourcepub async fn trace_setting(
&self,
req: TraceSettingRequest,
) -> Result<TraceSettingResponse, Error>
pub async fn trace_setting( &self, req: TraceSettingRequest, ) -> Result<TraceSettingResponse, Error>
Update and get the trace setting of the Triton server.
Trait Implementations§
Auto Trait Implementations§
impl !Freeze for Client
impl !RefUnwindSafe for Client
impl Send for Client
impl Sync for Client
impl Unpin for Client
impl !UnwindSafe for Client
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoRequest<T> for T
impl<T> IntoRequest<T> for T
Source§fn into_request(self) -> Request<T>
fn into_request(self) -> Request<T>
T
in a tonic::Request