class Aws::SageMaker::Types::AsyncInferenceClientConfig
Configures the behavior of the client used by Amazon SageMaker
to interact with the model container during asynchronous inference.
@note When making an API call, you may pass AsyncInferenceClientConfig
data as a hash: { max_concurrent_invocations_per_instance: 1, }
@!attribute [rw] max_concurrent_invocations_per_instance
The maximum number of concurrent requests sent by the SageMaker client to the model container. If no value is provided, Amazon SageMaker will choose an optimal value for you. @return [Integer]
@see docs.aws.amazon.com/goto/WebAPI/sagemaker-2017-07-24/AsyncInferenceClientConfig AWS API Documentation
Constants
- SENSITIVE