class Aws::SageMaker::Types::TransformJobDefinition

Defines the input needed to run a transform job using the inference specification specified in the algorithm.

@note When making an API call, you may pass TransformJobDefinition

data as a hash:

    {
      max_concurrent_transforms: 1,
      max_payload_in_mb: 1,
      batch_strategy: "MultiRecord", # accepts MultiRecord, SingleRecord
      environment: {
        "TransformEnvironmentKey" => "TransformEnvironmentValue",
      },
      transform_input: { # required
        data_source: { # required
          s3_data_source: { # required
            s3_data_type: "ManifestFile", # required, accepts ManifestFile, S3Prefix, AugmentedManifestFile
            s3_uri: "S3Uri", # required
          },
        },
        content_type: "ContentType",
        compression_type: "None", # accepts None, Gzip
        split_type: "None", # accepts None, Line, RecordIO, TFRecord
      },
      transform_output: { # required
        s3_output_path: "S3Uri", # required
        accept: "Accept",
        assemble_with: "None", # accepts None, Line
        kms_key_id: "KmsKeyId",
      },
      transform_resources: { # required
        instance_type: "ml.m4.xlarge", # required, accepts ml.m4.xlarge, ml.m4.2xlarge, ml.m4.4xlarge, ml.m4.10xlarge, ml.m4.16xlarge, ml.c4.xlarge, ml.c4.2xlarge, ml.c4.4xlarge, ml.c4.8xlarge, ml.p2.xlarge, ml.p2.8xlarge, ml.p2.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.c5.xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.c5.18xlarge, ml.m5.large, ml.m5.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge
        instance_count: 1, # required
        volume_kms_key_id: "KmsKeyId",
      },
    }

@!attribute [rw] max_concurrent_transforms

The maximum number of parallel requests that can be sent to each
instance in a transform job. The default value is 1.
@return [Integer]

@!attribute [rw] max_payload_in_mb

The maximum payload size allowed, in MB. A payload is the data
portion of a record (without metadata).
@return [Integer]

@!attribute [rw] batch_strategy

A string that determines the number of records included in a single
mini-batch.

`SingleRecord` means only one record is used per mini-batch.
`MultiRecord` means a mini-batch is set to contain as many records
that can fit within the `MaxPayloadInMB` limit.
@return [String]

@!attribute [rw] environment

The environment variables to set in the Docker container. We support
up to 16 key and values entries in the map.
@return [Hash<String,String>]

@!attribute [rw] transform_input

A description of the input source and the way the transform job
consumes it.
@return [Types::TransformInput]

@!attribute [rw] transform_output

Identifies the Amazon S3 location where you want Amazon SageMaker to
save the results from the transform job.
@return [Types::TransformOutput]

@!attribute [rw] transform_resources

Identifies the ML compute instances for the transform job.
@return [Types::TransformResources]

@see docs.aws.amazon.com/goto/WebAPI/sagemaker-2017-07-24/TransformJobDefinition AWS API Documentation

Constants

SENSITIVE