Builder

class Builder

Properties

Link copied to clipboard

Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has reached capacity. The default value is Stop. If you want the load test to continue after invocations have flattened, set the value to Continue.

Link copied to clipboard

The maximum number of requests per minute expected for the endpoint.

Link copied to clipboard

The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the local communication time taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.