class Aws::Glue::Types::CreateMLTransformRequest
@note When making an API call, you may pass CreateMLTransformRequest
data as a hash: { name: "NameString", # required description: "DescriptionString", input_record_tables: [ # required { database_name: "NameString", # required table_name: "NameString", # required catalog_id: "NameString", connection_name: "NameString", }, ], parameters: { # required transform_type: "FIND_MATCHES", # required, accepts FIND_MATCHES find_matches_parameters: { primary_key_column_name: "ColumnNameString", precision_recall_tradeoff: 1.0, accuracy_cost_tradeoff: 1.0, enforce_provided_labels: false, }, }, role: "RoleString", # required glue_version: "GlueVersionString", max_capacity: 1.0, worker_type: "Standard", # accepts Standard, G.1X, G.2X number_of_workers: 1, timeout: 1, max_retries: 1, tags: { "TagKey" => "TagValue", }, transform_encryption: { ml_user_data_encryption: { ml_user_data_encryption_mode: "DISABLED", # required, accepts DISABLED, SSE-KMS kms_key_id: "NameString", }, task_run_security_configuration_name: "NameString", }, }
@!attribute [rw] name
The unique name that you give the transform when you create it. @return [String]
@!attribute [rw] description
A description of the machine learning transform that is being defined. The default is an empty string. @return [String]
@!attribute [rw] input_record_tables
A list of Glue table definitions used by the transform. @return [Array<Types::GlueTable>]
@!attribute [rw] parameters
The algorithmic parameters that are specific to the transform type used. Conditionally dependent on the transform type. @return [Types::TransformParameters]
@!attribute [rw] role
The name or Amazon Resource Name (ARN) of the IAM role with the required permissions. The required permissions include both Glue service role permissions to Glue resources, and Amazon S3 permissions required by the transform. * This role needs Glue service role permissions to allow access to resources in Glue. See [Attach a Policy to IAM Users That Access Glue][1]. * This role needs permission to your Amazon Simple Storage Service (Amazon S3) sources, targets, temporary directory, scripts, and any libraries used by the task run for this transform. [1]: https://docs.aws.amazon.com/glue/latest/dg/attach-policy-iam-user.html @return [String]
@!attribute [rw] glue_version
This value determines which version of Glue this machine learning transform is compatible with. Glue 1.0 is recommended for most customers. If the value is not set, the Glue compatibility defaults to Glue 0.9. For more information, see [Glue Versions][1] in the developer guide. [1]: https://docs.aws.amazon.com/glue/latest/dg/release-notes.html#release-notes-versions @return [String]
@!attribute [rw] max_capacity
The number of Glue data processing units (DPUs) that are allocated to task runs for this transform. You can allocate from 2 to 100 DPUs; the default is 10. A DPU is a relative measure of processing power that consists of 4 vCPUs of compute capacity and 16 GB of memory. For more information, see the [Glue pricing page][1]. `MaxCapacity` is a mutually exclusive option with `NumberOfWorkers` and `WorkerType`. * If either `NumberOfWorkers` or `WorkerType` is set, then `MaxCapacity` cannot be set. * If `MaxCapacity` is set then neither `NumberOfWorkers` or `WorkerType` can be set. * If `WorkerType` is set, then `NumberOfWorkers` is required (and vice versa). * `MaxCapacity` and `NumberOfWorkers` must both be at least 1. When the `WorkerType` field is set to a value other than `Standard`, the `MaxCapacity` field is set automatically and becomes read-only. When the `WorkerType` field is set to a value other than `Standard`, the `MaxCapacity` field is set automatically and becomes read-only. [1]: https://aws.amazon.com/glue/pricing/ @return [Float]
@!attribute [rw] worker_type
The type of predefined worker that is allocated when this task runs. Accepts a value of Standard, G.1X, or G.2X. * For the `Standard` worker type, each worker provides 4 vCPU, 16 GB of memory and a 50GB disk, and 2 executors per worker. * For the `G.1X` worker type, each worker provides 4 vCPU, 16 GB of memory and a 64GB disk, and 1 executor per worker. * For the `G.2X` worker type, each worker provides 8 vCPU, 32 GB of memory and a 128GB disk, and 1 executor per worker. `MaxCapacity` is a mutually exclusive option with `NumberOfWorkers` and `WorkerType`. * If either `NumberOfWorkers` or `WorkerType` is set, then `MaxCapacity` cannot be set. * If `MaxCapacity` is set then neither `NumberOfWorkers` or `WorkerType` can be set. * If `WorkerType` is set, then `NumberOfWorkers` is required (and vice versa). * `MaxCapacity` and `NumberOfWorkers` must both be at least 1. @return [String]
@!attribute [rw] number_of_workers
The number of workers of a defined `workerType` that are allocated when this task runs. If `WorkerType` is set, then `NumberOfWorkers` is required (and vice versa). @return [Integer]
@!attribute [rw] timeout
The timeout of the task run for this transform in minutes. This is the maximum time that a task run for this transform can consume resources before it is terminated and enters `TIMEOUT` status. The default is 2,880 minutes (48 hours). @return [Integer]
@!attribute [rw] max_retries
The maximum number of times to retry a task for this transform after a task run fails. @return [Integer]
@!attribute [rw] tags
The tags to use with this machine learning transform. You may use tags to limit access to the machine learning transform. For more information about tags in Glue, see [Amazon Web Services Tags in Glue][1] in the developer guide. [1]: https://docs.aws.amazon.com/glue/latest/dg/monitor-tags.html @return [Hash<String,String>]
@!attribute [rw] transform_encryption
The encryption-at-rest settings of the transform that apply to accessing user data. Machine learning transforms can access user data encrypted in Amazon S3 using KMS. @return [Types::TransformEncryption]
@see docs.aws.amazon.com/goto/WebAPI/glue-2017-03-31/CreateMLTransformRequest AWS API Documentation
Constants
- SENSITIVE