InferenceComponentCapacitySize
Specifies the type and size of the endpoint capacity to activate for a rolling deployment or a rollback strategy. You can specify your batches as either of the following:
A count of inference component copies
The overall percentage or your fleet
For a rollback strategy, if you don't specify the fields in this object, or if you set the Value
parameter to 100%, then SageMaker AI uses a blue/green rollback strategy and rolls all traffic back to the blue fleet.
Types
Properties
Functions
Link copied to clipboard
inline fun copy(block: InferenceComponentCapacitySize.Builder.() -> Unit = {}): InferenceComponentCapacitySize