NAME

Paws::SageMaker::CreateProcessingJob - Arguments for method CreateProcessingJob on Paws::SageMaker

DESCRIPTION

This class represents the parameters used for calling the method CreateProcessingJob on the Amazon SageMaker Service service. Use the attributes of this class as arguments to method CreateProcessingJob.

You shouldn't make instances of this class. Each attribute should be used as a named argument in the call to CreateProcessingJob.

SYNOPSIS

my $api.sagemaker = Paws->service('SageMaker');
my $CreateProcessingJobResponse = $api . sagemaker->CreateProcessingJob(
  AppSpecification => {
    ImageUri           => 'MyImageUri',    # max: 255
    ContainerArguments => [
      'MyContainerArgument', ...           # max: 256
    ],    # min: 1, max: 100; OPTIONAL
    ContainerEntrypoint => [
      'MyContainerEntrypointString', ...    # max: 256
    ],    # min: 1, max: 100; OPTIONAL
  },
  ProcessingJobName   => 'MyProcessingJobName',
  ProcessingResources => {
    ClusterConfig => {
      InstanceCount => 1,                # min: 1, max: 100
      InstanceType  => 'ml.t3.medium'
      , # values: ml.t3.medium, ml.t3.large, ml.t3.xlarge, ml.t3.2xlarge, ml.m4.xlarge, ml.m4.2xlarge, ml.m4.4xlarge, ml.m4.10xlarge, ml.m4.16xlarge, ml.c4.xlarge, ml.c4.2xlarge, ml.c4.4xlarge, ml.c4.8xlarge, ml.p2.xlarge, ml.p2.8xlarge, ml.p2.16xlarge, ml.p3.2xlarge, ml.p3.8xlarge, ml.p3.16xlarge, ml.c5.xlarge, ml.c5.2xlarge, ml.c5.4xlarge, ml.c5.9xlarge, ml.c5.18xlarge, ml.m5.large, ml.m5.xlarge, ml.m5.2xlarge, ml.m5.4xlarge, ml.m5.12xlarge, ml.m5.24xlarge, ml.r5.large, ml.r5.xlarge, ml.r5.2xlarge, ml.r5.4xlarge, ml.r5.8xlarge, ml.r5.12xlarge, ml.r5.16xlarge, ml.r5.24xlarge, ml.g4dn.xlarge, ml.g4dn.2xlarge, ml.g4dn.4xlarge, ml.g4dn.8xlarge, ml.g4dn.12xlarge, ml.g4dn.16xlarge
      VolumeSizeInGB => 1,               # min: 1, max: 16384
      VolumeKmsKeyId => 'MyKmsKeyId',    # max: 2048; OPTIONAL
    },

  },
  RoleArn     => 'MyRoleArn',
  Environment => {
    'MyProcessingEnvironmentKey' =>
      'MyProcessingEnvironmentValue',    # key: max: 256, value: max: 256
  },    # OPTIONAL
  ExperimentConfig => {
    ExperimentName => 'MyExperimentEntityName', # min: 1, max: 120; OPTIONAL
    TrialComponentDisplayName =>
      'MyExperimentEntityName',                 # min: 1, max: 120; OPTIONAL
    TrialName => 'MyExperimentEntityName',      # min: 1, max: 120; OPTIONAL
  },    # OPTIONAL
  NetworkConfig => {
    EnableInterContainerTrafficEncryption => 1,    # OPTIONAL
    EnableNetworkIsolation                => 1,    # OPTIONAL
    VpcConfig                             => {
      SecurityGroupIds => [
        'MySecurityGroupId', ...                   # max: 32
      ],    # min: 1, max: 5
      Subnets => [
        'MySubnetId', ...    # max: 32
      ],    # min: 1, max: 16

    },    # OPTIONAL
  },    # OPTIONAL
  ProcessingInputs => [
    {
      InputName         => 'MyString',
      AppManaged        => 1,            # OPTIONAL
      DatasetDefinition => {
        AthenaDatasetDefinition => {
          Catalog      => 'MyAthenaCatalog',     # min: 1, max: 256
          Database     => 'MyAthenaDatabase',    # min: 1, max: 255
          OutputFormat =>
            'PARQUET',    # values: PARQUET, ORC, AVRO, JSON, TEXTFILE
          OutputS3Uri       => 'MyS3Uri',              # max: 1024
          QueryString       => 'MyAthenaQueryString',  # min: 1, max: 4096
          KmsKeyId          => 'MyKmsKeyId',           # max: 2048; OPTIONAL
          OutputCompression =>
            'GZIP',    # values: GZIP, SNAPPY, ZLIB; OPTIONAL
          WorkGroup => 'MyAthenaWorkGroup',    # min: 1, max: 128; OPTIONAL
        },    # OPTIONAL
        DataDistributionType => 'FullyReplicated'
        ,     # values: FullyReplicated, ShardedByS3Key; OPTIONAL
        InputMode => 'Pipe',                  # values: Pipe, File; OPTIONAL
        LocalPath => 'MyProcessingLocalPath', # max: 256; OPTIONAL
        RedshiftDatasetDefinition => {
          ClusterId      => 'MyRedshiftClusterId',    # min: 1, max: 63
          ClusterRoleArn => 'MyRoleArn',              # min: 20, max: 2048
          Database       => 'MyRedshiftDatabase',     # min: 1, max: 64
          DbUser         => 'MyRedshiftUserName',     # min: 1, max: 128
          OutputFormat   => 'PARQUET',                # values: PARQUET, CSV
          OutputS3Uri    => 'MyS3Uri',                # max: 1024
          QueryString    => 'MyRedshiftQueryString',  # min: 1, max: 4096
          KmsKeyId       => 'MyKmsKeyId',             # max: 2048; OPTIONAL
          OutputCompression =>
            'None',    # values: None, GZIP, BZIP2, ZSTD, SNAPPY; OPTIONAL
        },    # OPTIONAL
      },    # OPTIONAL
      S3Input => {
        S3DataType => 'ManifestFile',    # values: ManifestFile, S3Prefix
        S3Uri      => 'MyS3Uri',         # max: 1024
        LocalPath  => 'MyProcessingLocalPath',    # max: 256; OPTIONAL
        S3CompressionType      => 'None',    # values: None, Gzip; OPTIONAL
        S3DataDistributionType => 'FullyReplicated'
        ,    # values: FullyReplicated, ShardedByS3Key; OPTIONAL
        S3InputMode => 'Pipe',    # values: Pipe, File; OPTIONAL
      },    # OPTIONAL
    },
    ...
  ],    # OPTIONAL
  ProcessingOutputConfig => {
    Outputs => [
      {
        OutputName         => 'MyString',
        AppManaged         => 1,            # OPTIONAL
        FeatureStoreOutput => {
          FeatureGroupName => 'MyFeatureGroupName',    # min: 1, max: 64

        },    # OPTIONAL
        S3Output => {
          LocalPath    => 'MyProcessingLocalPath',    # max: 256; OPTIONAL
          S3UploadMode => 'Continuous',    # values: Continuous, EndOfJob
          S3Uri        => 'MyS3Uri',       # max: 1024

        },    # OPTIONAL
      },
      ...
    ],    # max: 10
    KmsKeyId => 'MyKmsKeyId',    # max: 2048; OPTIONAL
  },    # OPTIONAL
  StoppingCondition => {
    MaxRuntimeInSeconds => 1,    # min: 1, max: 604800

  },    # OPTIONAL
  Tags => [
    {
      Key   => 'MyTagKey',      # min: 1, max: 128
      Value => 'MyTagValue',    # max: 256

    },
    ...
  ],    # OPTIONAL
);

# Results:
my $ProcessingJobArn = $CreateProcessingJobResponse->ProcessingJobArn;

# Returns a L<Paws::SageMaker::CreateProcessingJobResponse> object.

Values for attributes that are native types (Int, String, Float, etc) can passed as-is (scalar values). Values for complex Types (objects) can be passed as a HashRef. The keys and values of the hashref will be used to instance the underlying object. For the AWS API documentation, see https://docs.aws.amazon.com/goto/WebAPI/api.sagemaker/CreateProcessingJob

ATTRIBUTES

REQUIRED AppSpecification => Paws::SageMaker::AppSpecification

Configures the processing job to run a specified Docker container image.

Environment => Paws::SageMaker::ProcessingEnvironmentMap

The environment variables to set in the Docker container. Up to 100 key and values entries in the map are supported.

ExperimentConfig => Paws::SageMaker::ExperimentConfig

NetworkConfig => Paws::SageMaker::NetworkConfig

Networking options for a processing job, such as whether to allow inbound and outbound network calls to and from processing containers, and the VPC subnets and security groups to use for VPC-enabled processing jobs.

ProcessingInputs => ArrayRef[Paws::SageMaker::ProcessingInput]

An array of inputs configuring the data to download into the processing container.

REQUIRED ProcessingJobName => Str

The name of the processing job. The name must be unique within an Amazon Web Services Region in the Amazon Web Services account.

ProcessingOutputConfig => Paws::SageMaker::ProcessingOutputConfig

Output configuration for the processing job.

REQUIRED ProcessingResources => Paws::SageMaker::ProcessingResources

Identifies the resources, ML compute instances, and ML storage volumes to deploy for a processing job. In distributed training, you specify more than one instance.

REQUIRED RoleArn => Str

The Amazon Resource Name (ARN) of an IAM role that Amazon SageMaker can assume to perform tasks on your behalf.

StoppingCondition => Paws::SageMaker::ProcessingStoppingCondition

The time limit for how long the processing job is allowed to run.

Tags => ArrayRef[Paws::SageMaker::Tag]

(Optional) An array of key-value pairs. For more information, see Using Cost Allocation Tags (https://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/cost-alloc-tags.html#allocation-whatURL) in the Amazon Web Services Billing and Cost Management User Guide.

SEE ALSO

This class forms part of Paws, documenting arguments for method CreateProcessingJob in Paws::SageMaker

BUGS and CONTRIBUTIONS

The source code is located here: https://github.com/pplu/aws-sdk-perl

Please report bugs to: https://github.com/pplu/aws-sdk-perl/issues