class Aws::SageMaker::Types::ProcessingInput

The inputs for a processing job. The processing input must specify exactly one of either `S3Input` or `DatasetDefinition` types.

@note When making an API call, you may pass ProcessingInput

data as a hash:

    {
      input_name: "String", # required
      app_managed: false,
      s3_input: {
        s3_uri: "S3Uri", # required
        local_path: "ProcessingLocalPath",
        s3_data_type: "ManifestFile", # required, accepts ManifestFile, S3Prefix
        s3_input_mode: "Pipe", # accepts Pipe, File
        s3_data_distribution_type: "FullyReplicated", # accepts FullyReplicated, ShardedByS3Key
        s3_compression_type: "None", # accepts None, Gzip
      },
      dataset_definition: {
        athena_dataset_definition: {
          catalog: "AthenaCatalog", # required
          database: "AthenaDatabase", # required
          query_string: "AthenaQueryString", # required
          work_group: "AthenaWorkGroup",
          output_s3_uri: "S3Uri", # required
          kms_key_id: "KmsKeyId",
          output_format: "PARQUET", # required, accepts PARQUET, ORC, AVRO, JSON, TEXTFILE
          output_compression: "GZIP", # accepts GZIP, SNAPPY, ZLIB
        },
        redshift_dataset_definition: {
          cluster_id: "RedshiftClusterId", # required
          database: "RedshiftDatabase", # required
          db_user: "RedshiftUserName", # required
          query_string: "RedshiftQueryString", # required
          cluster_role_arn: "RoleArn", # required
          output_s3_uri: "S3Uri", # required
          kms_key_id: "KmsKeyId",
          output_format: "PARQUET", # required, accepts PARQUET, CSV
          output_compression: "None", # accepts None, GZIP, BZIP2, ZSTD, SNAPPY
        },
        local_path: "ProcessingLocalPath",
        data_distribution_type: "FullyReplicated", # accepts FullyReplicated, ShardedByS3Key
        input_mode: "Pipe", # accepts Pipe, File
      },
    }

@!attribute [rw] input_name

The name for the processing job input.
@return [String]

@!attribute [rw] app_managed

When `True`, input operations such as data download are managed
natively by the processing job application. When `False` (default),
input operations are managed by Amazon SageMaker.
@return [Boolean]

@!attribute [rw] s3_input

Configuration for downloading input data from Amazon S3 into the
processing container.
@return [Types::ProcessingS3Input]

@!attribute [rw] dataset_definition

Configuration for a Dataset Definition input.
@return [Types::DatasetDefinition]

@see docs.aws.amazon.com/goto/WebAPI/sagemaker-2017-07-24/ProcessingInput AWS API Documentation

Constants

SENSITIVE