#[non_exhaustive]
pub struct PendingProductionVariantSummary { pub variant_name: Option<String>, pub deployed_images: Option<Vec<DeployedImage>>, pub current_weight: Option<f32>, pub desired_weight: Option<f32>, pub current_instance_count: Option<i32>, pub desired_instance_count: Option<i32>, pub instance_type: Option<ProductionVariantInstanceType>, pub accelerator_type: Option<ProductionVariantAcceleratorType>, pub variant_status: Option<Vec<ProductionVariantStatus>>, pub current_serverless_config: Option<ProductionVariantServerlessConfig>, pub desired_serverless_config: Option<ProductionVariantServerlessConfig>, }
Expand description

The production variant summary for a deployment when an endpoint is creating or updating with the CreateEndpoint or UpdateEndpoint operations. Describes the VariantStatus , weight and capacity for a production variant associated with an endpoint.

Fields (Non-exhaustive)

This struct is marked as non-exhaustive
Non-exhaustive structs could have additional fields added in future. Therefore, non-exhaustive structs cannot be constructed in external crates using the traditional Struct { .. } syntax; cannot be matched against without a wildcard ..; and struct update syntax will not work.
variant_name: Option<String>

The name of the variant.

deployed_images: Option<Vec<DeployedImage>>

An array of DeployedImage objects that specify the Amazon EC2 Container Registry paths of the inference images deployed on instances of this ProductionVariant.

current_weight: Option<f32>

The weight associated with the variant.

desired_weight: Option<f32>

The requested weight for the variant in this deployment, as specified in the endpoint configuration for the endpoint. The value is taken from the request to the CreateEndpointConfig operation.

current_instance_count: Option<i32>

The number of instances associated with the variant.

desired_instance_count: Option<i32>

The number of instances requested in this deployment, as specified in the endpoint configuration for the endpoint. The value is taken from the request to the CreateEndpointConfig operation.

instance_type: Option<ProductionVariantInstanceType>

The type of instances associated with the variant.

accelerator_type: Option<ProductionVariantAcceleratorType>

The size of the Elastic Inference (EI) instance to use for the production variant. EI instances provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon SageMaker.

variant_status: Option<Vec<ProductionVariantStatus>>

The endpoint variant status which describes the current deployment stage status or operational status.

current_serverless_config: Option<ProductionVariantServerlessConfig>

The serverless configuration for the endpoint.

Serverless Inference is in preview release for Amazon SageMaker and is subject to change. We do not recommend using this feature in production environments.

desired_serverless_config: Option<ProductionVariantServerlessConfig>

The serverless configuration requested for this deployment, as specified in the endpoint configuration for the endpoint.

Serverless Inference is in preview release for Amazon SageMaker and is subject to change. We do not recommend using this feature in production environments.

Implementations

The name of the variant.

An array of DeployedImage objects that specify the Amazon EC2 Container Registry paths of the inference images deployed on instances of this ProductionVariant.

The weight associated with the variant.

The requested weight for the variant in this deployment, as specified in the endpoint configuration for the endpoint. The value is taken from the request to the CreateEndpointConfig operation.

The number of instances associated with the variant.

The number of instances requested in this deployment, as specified in the endpoint configuration for the endpoint. The value is taken from the request to the CreateEndpointConfig operation.

The type of instances associated with the variant.

The size of the Elastic Inference (EI) instance to use for the production variant. EI instances provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon SageMaker.

The endpoint variant status which describes the current deployment stage status or operational status.

The serverless configuration for the endpoint.

Serverless Inference is in preview release for Amazon SageMaker and is subject to change. We do not recommend using this feature in production environments.

The serverless configuration requested for this deployment, as specified in the endpoint configuration for the endpoint.

Serverless Inference is in preview release for Amazon SageMaker and is subject to change. We do not recommend using this feature in production environments.

Creates a new builder-style object to manufacture PendingProductionVariantSummary

Trait Implementations

Returns a copy of the value. Read more

Performs copy-assignment from source. Read more

Formats the value using the given formatter. Read more

This method tests for self and other values to be equal, and is used by ==. Read more

This method tests for !=.

Auto Trait Implementations

Blanket Implementations

Gets the TypeId of self. Read more

Immutably borrows from an owned value. Read more

Mutably borrows from an owned value. Read more

Returns the argument unchanged.

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

The resulting type after obtaining ownership.

Creates owned data from borrowed data, usually by cloning. Read more

🔬 This is a nightly-only experimental API. (toowned_clone_into)

Uses borrowed data to replace owned data, usually by cloning. Read more

The type returned in the event of a conversion error.

Performs the conversion.

The type returned in the event of a conversion error.

Performs the conversion.

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more