ProductionVariantSummary
Describes weight and capacities for a production variant associated with an endpoint. If you sent a request to the UpdateEndpointWeightsAndCapacities
API and the endpoint status is Updating
, you get different desired and current values.
Types
Properties
Settings for the capacity reservation for the compute instances that SageMaker AI reserves for an endpoint.
The number of instances associated with the variant.
The serverless configuration for the endpoint.
The weight associated with the variant.
An array of DeployedImage
objects that specify the Amazon EC2 Container Registry paths of the inference images deployed on instances of this ProductionVariant
.
The number of instances requested in the UpdateEndpointWeightsAndCapacities
request.
The serverless configuration requested for the endpoint update.
The requested weight, as specified in the UpdateEndpointWeightsAndCapacities
request.
Settings that control the range in the number of instances that the endpoint provisions as it scales up or down to accommodate traffic.
Settings that control how the endpoint routes incoming traffic to the instances that the endpoint hosts.
The name of the variant.
The endpoint variant status which describes the current deployment stage status or operational status.