Interface AsyncInferenceClientConfig.Builder

  • Method Details

    • maxConcurrentInvocationsPerInstance

      AsyncInferenceClientConfig.Builder maxConcurrentInvocationsPerInstance(Integer maxConcurrentInvocationsPerInstance)

      The maximum number of concurrent requests sent by the SageMaker client to the model container. If no value is provided, SageMaker chooses an optimal value.

      Parameters:
      maxConcurrentInvocationsPerInstance - The maximum number of concurrent requests sent by the SageMaker client to the model container. If no value is provided, SageMaker chooses an optimal value.
      Returns:
      Returns a reference to this object so that method calls can be chained together.