NAME
Paws::SageMaker::CreateProcessingJob - Arguments for method CreateProcessingJob on Paws::SageMaker
DESCRIPTION
This class represents the parameters used for calling the method CreateProcessingJob on the Amazon SageMaker Service service. Use the attributes of this class as arguments to method CreateProcessingJob.
You shouldn't make instances of this class. Each attribute should be used as a named argument in the call to CreateProcessingJob.
SYNOPSIS
my $api.sagemaker = Paws->service('SageMaker');
my $CreateProcessingJobResponse = $api . sagemaker->CreateProcessingJob(
AppSpecification => {
ImageUri => 'MyImageUri', # max: 255
ContainerArguments => [
'MyContainerArgument', ... # max: 256
], # min: 1, max: 100; OPTIONAL
ContainerEntrypoint => [
'MyContainerEntrypointString', ... # max: 256
], # min: 1, max: 100; OPTIONAL
},
ProcessingJobName => 'MyProcessingJobName',
ProcessingResources => {
ClusterConfig => {
InstanceCount => 1, # min: 1, max: 100
InstanceType => 'ml.t3.medium'
, # values: ml.t3.medium, ml.t3.large, ml.t3.xlarge, ml.t3.2xlarge, ml.m4.xlarge, ml.m4.2xlarge, ml.m4.4xlarge, ml.m4.10xlarge, ml.m4.16xlarge, ml.c4.xlarge, ml.c4.2xlarge, ml.c4.4xlarge, ml.c4.8xlarge, ml.p2.xlarge, ml.p2.8xlarge, ml.p2.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.c5.xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.c5.18xlarge, ml.m5.large, ml.m5.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.r5.large, ml.r5.xlarge, ml.r5.2xlarge, ml.r5.4xlarge, ml.r5.8xlarge, ml.r5.12xlarge, ml.r5.16xlarge, ml.r5.24xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge
VolumeSizeInGB => 1, # min: 1, max: 16384
VolumeKmsKeyId => 'MyKmsKeyId', # max: 2048; OPTIONAL
},
},
RoleArn => 'MyRoleArn',
Environment => {
'MyProcessingEnvironmentKey' =>
'MyProcessingEnvironmentValue', # key: max: 256, value: max: 256
}, # OPTIONAL
ExperimentConfig => {
ExperimentName => 'MyExperimentEntityName', # min: 1, max: 120; OPTIONAL
TrialComponentDisplayName =>
'MyExperimentEntityName', # min: 1, max: 120; OPTIONAL
TrialName => 'MyExperimentEntityName', # min: 1, max: 120; OPTIONAL
}, # OPTIONAL
NetworkConfig => {
EnableInterContainerTrafficEncryption => 1, # OPTIONAL
EnableNetworkIsolation => 1, # OPTIONAL
VpcConfig => {
SecurityGroupIds => [
'MySecurityGroupId', ... # max: 32
], # min: 1, max: 5
Subnets => [
'MySubnetId', ... # max: 32
], # min: 1, max: 16
}, # OPTIONAL
}, # OPTIONAL
ProcessingInputs => [
{
InputName => 'MyString',
AppManaged => 1, # OPTIONAL
DatasetDefinition => {
AthenaDatasetDefinition => {
Catalog => 'MyAthenaCatalog', # min: 1, max: 256
Database => 'MyAthenaDatabase', # min: 1, max: 255
OutputFormat =>
'PARQUET', # values: PARQUET, ORC, AVRO, JSON, TEXTFILE
OutputS3Uri => 'MyS3Uri', # max: 1024
QueryString => 'MyAthenaQueryString', # min: 1, max: 4096
KmsKeyId => 'MyKmsKeyId', # max: 2048; OPTIONAL
OutputCompression =>
'GZIP', # values: GZIP, SNAPPY, ZLIB; OPTIONAL
WorkGroup => 'MyAthenaWorkGroup', # min: 1, max: 128; OPTIONAL
}, # OPTIONAL
DataDistributionType => 'FullyReplicated'
, # values: FullyReplicated, ShardedByS3Key; OPTIONAL
InputMode => 'Pipe', # values: Pipe, File; OPTIONAL
LocalPath => 'MyProcessingLocalPath', # max: 256; OPTIONAL
RedshiftDatasetDefinition => {
ClusterId => 'MyRedshiftClusterId', # min: 1, max: 63
ClusterRoleArn => 'MyRoleArn', # min: 20, max: 2048
Database => 'MyRedshiftDatabase', # min: 1, max: 64
DbUser => 'MyRedshiftUserName', # min: 1, max: 128
OutputFormat => 'PARQUET', # values: PARQUET, CSV
OutputS3Uri => 'MyS3Uri', # max: 1024
QueryString => 'MyRedshiftQueryString', # min: 1, max: 4096
KmsKeyId => 'MyKmsKeyId', # max: 2048; OPTIONAL
OutputCompression =>
'None', # values: None, GZIP, BZIP2, ZSTD, SNAPPY; OPTIONAL
}, # OPTIONAL
}, # OPTIONAL
S3Input => {
S3DataType => 'ManifestFile', # values: ManifestFile, S3Prefix
S3Uri => 'MyS3Uri', # max: 1024
LocalPath => 'MyProcessingLocalPath', # max: 256; OPTIONAL
S3CompressionType => 'None', # values: None, Gzip; OPTIONAL
S3DataDistributionType => 'FullyReplicated'
, # values: FullyReplicated, ShardedByS3Key; OPTIONAL
S3InputMode => 'Pipe', # values: Pipe, File; OPTIONAL
}, # OPTIONAL
},
...
], # OPTIONAL
ProcessingOutputConfig => {
Outputs => [
{
OutputName => 'MyString',
AppManaged => 1, # OPTIONAL
FeatureStoreOutput => {
FeatureGroupName => 'MyFeatureGroupName', # min: 1, max: 64
}, # OPTIONAL
S3Output => {
LocalPath => 'MyProcessingLocalPath', # max: 256; OPTIONAL
S3UploadMode => 'Continuous', # values: Continuous, EndOfJob
S3Uri => 'MyS3Uri', # max: 1024
}, # OPTIONAL
},
...
], # max: 10
KmsKeyId => 'MyKmsKeyId', # max: 2048; OPTIONAL
}, # OPTIONAL
StoppingCondition => {
MaxRuntimeInSeconds => 1, # min: 1, max: 604800
}, # OPTIONAL
Tags => [
{
Key => 'MyTagKey', # min: 1, max: 128
Value => 'MyTagValue', # max: 256
},
...
], # OPTIONAL
);
# Results:
my $ProcessingJobArn = $CreateProcessingJobResponse->ProcessingJobArn;
# Returns a L<Paws::SageMaker::CreateProcessingJobResponse> object.
Values for attributes that are native types (Int, String, Float, etc) can passed as-is (scalar values). Values for complex Types (objects) can be passed as a HashRef. The keys and values of the hashref will be used to instance the underlying object. For the AWS API documentation, see https://docs.aws.amazon.com/goto/WebAPI/api.sagemaker/CreateProcessingJob
ATTRIBUTES
REQUIRED AppSpecification => Paws::SageMaker::AppSpecification
Configures the processing job to run a specified Docker container image.
Environment => Paws::SageMaker::ProcessingEnvironmentMap
The environment variables to set in the Docker container. Up to 100 key and values entries in the map are supported.
ExperimentConfig => Paws::SageMaker::ExperimentConfig
NetworkConfig => Paws::SageMaker::NetworkConfig
Networking options for a processing job, such as whether to allow inbound and outbound network calls to and from processing containers, and the VPC subnets and security groups to use for VPC-enabled processing jobs.
ProcessingInputs => ArrayRef[Paws::SageMaker::ProcessingInput]
An array of inputs configuring the data to download into the processing container.
REQUIRED ProcessingJobName => Str
The name of the processing job. The name must be unique within an Amazon Web Services Region in the Amazon Web Services account.
ProcessingOutputConfig => Paws::SageMaker::ProcessingOutputConfig
Output configuration for the processing job.
REQUIRED ProcessingResources => Paws::SageMaker::ProcessingResources
Identifies the resources, ML compute instances, and ML storage volumes to deploy for a processing job. In distributed training, you specify more than one instance.
REQUIRED RoleArn => Str
The Amazon Resource Name (ARN) of an IAM role that Amazon SageMaker can assume to perform tasks on your behalf.
StoppingCondition => Paws::SageMaker::ProcessingStoppingCondition
The time limit for how long the processing job is allowed to run.
Tags => ArrayRef[Paws::SageMaker::Tag]
(Optional) An array of key-value pairs. For more information, see Using Cost Allocation Tags (https://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/cost-alloc-tags.html#allocation-whatURL) in the Amazon Web Services Billing and Cost Management User Guide.
SEE ALSO
This class forms part of Paws, documenting arguments for method CreateProcessingJob in Paws::SageMaker
BUGS and CONTRIBUTIONS
The source code is located here: https://github.com/pplu/aws-sdk-perl
Please report bugs to: https://github.com/pplu/aws-sdk-perl/issues