Builder
class Builder
Properties
Link copied to clipboard
Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has reached capacity. The default value is Stop
. If you want the load test to continue after invocations have flattened, set the value to Continue
.
Link copied to clipboard
The maximum number of requests per minute expected for the endpoint.
Link copied to clipboard
The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the local communication time taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.