Interface ScalingPolicyMetric.Builder
- All Superinterfaces:
Buildable,CopyableBuilder<ScalingPolicyMetric.Builder,,ScalingPolicyMetric> SdkBuilder<ScalingPolicyMetric.Builder,,ScalingPolicyMetric> SdkPojo
- Enclosing class:
ScalingPolicyMetric
@Mutable
@NotThreadSafe
public static interface ScalingPolicyMetric.Builder
extends SdkPojo, CopyableBuilder<ScalingPolicyMetric.Builder,ScalingPolicyMetric>
-
Method Summary
Modifier and TypeMethodDescriptioninvocationsPerInstance(Integer invocationsPerInstance) The number of invocations sent to a model, normalized byInstanceCountin each ProductionVariant.modelLatency(Integer modelLatency) The interval of time taken by a model to respond as viewed from SageMaker.Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copyMethods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, buildMethods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFieldNameToField, sdkFields
-
Method Details
-
invocationsPerInstance
The number of invocations sent to a model, normalized by
InstanceCountin each ProductionVariant.1/numberOfInstancesis sent as the value on each request, wherenumberOfInstancesis the number of active instances for the ProductionVariant behind the endpoint at the time of the request.- Parameters:
invocationsPerInstance- The number of invocations sent to a model, normalized byInstanceCountin each ProductionVariant.1/numberOfInstancesis sent as the value on each request, wherenumberOfInstancesis the number of active instances for the ProductionVariant behind the endpoint at the time of the request.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
modelLatency
The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
- Parameters:
modelLatency- The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-