NAME
Paws::Glue - Perl Interface to AWS AWS Glue
SYNOPSIS
use Paws;
my $obj = Paws->service('Glue');
my $res = $obj->Method(
Arg1 => $val1,
Arg2 => [ 'V1', 'V2' ],
# if Arg3 is an object, the HashRef will be used as arguments to the constructor
# of the arguments type
Arg3 => { Att1 => 'Val1' },
# if Arg4 is an array of objects, the HashRefs will be passed as arguments to
# the constructor of the arguments type
Arg4 => [ { Att1 => 'Val1' }, { Att1 => 'Val2' } ],
);
DESCRIPTION
Glue
Defines the public endpoint for the Glue service.
For the AWS API documentation, see https://docs.aws.amazon.com/goto/WebAPI/glue-2017-03-31
METHODS
BatchCreatePartition
- DatabaseName => Str
- PartitionInputList => ArrayRef[Paws::Glue::PartitionInput]
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::BatchCreatePartition
Returns: a Paws::Glue::BatchCreatePartitionResponse instance
Creates one or more partitions in a batch operation.
BatchDeleteConnection
Each argument is described in detail in: Paws::Glue::BatchDeleteConnection
Returns: a Paws::Glue::BatchDeleteConnectionResponse instance
Deletes a list of connection definitions from the Data Catalog.
BatchDeletePartition
- DatabaseName => Str
- PartitionsToDelete => ArrayRef[Paws::Glue::PartitionValueList]
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::BatchDeletePartition
Returns: a Paws::Glue::BatchDeletePartitionResponse instance
Deletes one or more partitions in a batch operation.
BatchDeleteTable
Each argument is described in detail in: Paws::Glue::BatchDeleteTable
Returns: a Paws::Glue::BatchDeleteTableResponse instance
Deletes multiple tables at once.
After completing this operation, you no longer have access to the table versions and partitions that belong to the deleted table. Glue deletes these "orphaned" resources asynchronously in a timely manner, at the discretion of the service.
To ensure the immediate deletion of all related resources, before calling BatchDeleteTable
, use DeleteTableVersion
or BatchDeleteTableVersion
, and DeletePartition
or BatchDeletePartition
, to delete any resources that belong to the table.
BatchDeleteTableVersion
Each argument is described in detail in: Paws::Glue::BatchDeleteTableVersion
Returns: a Paws::Glue::BatchDeleteTableVersionResponse instance
Deletes a specified batch of versions of a table.
BatchGetCrawlers
Each argument is described in detail in: Paws::Glue::BatchGetCrawlers
Returns: a Paws::Glue::BatchGetCrawlersResponse instance
Returns a list of resource metadata for a given list of crawler names. After calling the ListCrawlers
operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
BatchGetDevEndpoints
Each argument is described in detail in: Paws::Glue::BatchGetDevEndpoints
Returns: a Paws::Glue::BatchGetDevEndpointsResponse instance
Returns a list of resource metadata for a given list of development endpoint names. After calling the ListDevEndpoints
operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
BatchGetJobs
Each argument is described in detail in: Paws::Glue::BatchGetJobs
Returns: a Paws::Glue::BatchGetJobsResponse instance
Returns a list of resource metadata for a given list of job names. After calling the ListJobs
operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
BatchGetPartition
- DatabaseName => Str
- PartitionsToGet => ArrayRef[Paws::Glue::PartitionValueList]
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::BatchGetPartition
Returns: a Paws::Glue::BatchGetPartitionResponse instance
Retrieves partitions in a batch request.
BatchGetTriggers
Each argument is described in detail in: Paws::Glue::BatchGetTriggers
Returns: a Paws::Glue::BatchGetTriggersResponse instance
Returns a list of resource metadata for a given list of trigger names. After calling the ListTriggers
operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
BatchGetWorkflows
Each argument is described in detail in: Paws::Glue::BatchGetWorkflows
Returns: a Paws::Glue::BatchGetWorkflowsResponse instance
Returns a list of resource metadata for a given list of workflow names. After calling the ListWorkflows
operation, you can call this operation to access the data to which you have been granted permissions. This operation supports all IAM permissions, including permission conditions that uses tags.
BatchStopJobRun
Each argument is described in detail in: Paws::Glue::BatchStopJobRun
Returns: a Paws::Glue::BatchStopJobRunResponse instance
Stops one or more job runs for a specified job definition.
BatchUpdatePartition
- DatabaseName => Str
- Entries => ArrayRef[Paws::Glue::BatchUpdatePartitionRequestEntry]
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::BatchUpdatePartition
Returns: a Paws::Glue::BatchUpdatePartitionResponse instance
Updates one or more partitions in a batch operation.
CancelMLTaskRun
Each argument is described in detail in: Paws::Glue::CancelMLTaskRun
Returns: a Paws::Glue::CancelMLTaskRunResponse instance
Cancels (stops) a task run. Machine learning task runs are asynchronous tasks that Glue runs on your behalf as part of various machine learning workflows. You can cancel a machine learning task run at any time by calling CancelMLTaskRun
with a task run's parent transform's TransformID
and the task run's TaskRunId
.
CheckSchemaVersionValidity
Each argument is described in detail in: Paws::Glue::CheckSchemaVersionValidity
Returns: a Paws::Glue::CheckSchemaVersionValidityResponse instance
Validates the supplied schema. This call has no side effects, it simply validates using the supplied schema using DataFormat
as the format. Since it does not take a schema set name, no compatibility checks are performed.
CreateClassifier
- [CsvClassifier => Paws::Glue::CreateCsvClassifierRequest]
- [GrokClassifier => Paws::Glue::CreateGrokClassifierRequest]
- [JsonClassifier => Paws::Glue::CreateJsonClassifierRequest]
- [XMLClassifier => Paws::Glue::CreateXMLClassifierRequest]
Each argument is described in detail in: Paws::Glue::CreateClassifier
Returns: a Paws::Glue::CreateClassifierResponse instance
Creates a classifier in the user's account. This can be a GrokClassifier
, an XMLClassifier
, a JsonClassifier
, or a CsvClassifier
, depending on which field of the request is present.
CreateConnection
- ConnectionInput => Paws::Glue::ConnectionInput
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::CreateConnection
Returns: a Paws::Glue::CreateConnectionResponse instance
Creates a connection definition in the Data Catalog.
CreateCrawler
- Name => Str
- Role => Str
- Targets => Paws::Glue::CrawlerTargets
- [Classifiers => ArrayRef[Str|Undef]]
- [Configuration => Str]
- [CrawlerSecurityConfiguration => Str]
- [DatabaseName => Str]
- [Description => Str]
- [LineageConfiguration => Paws::Glue::LineageConfiguration]
- [RecrawlPolicy => Paws::Glue::RecrawlPolicy]
- [Schedule => Str]
- [SchemaChangePolicy => Paws::Glue::SchemaChangePolicy]
- [TablePrefix => Str]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::CreateCrawler
Returns: a Paws::Glue::CreateCrawlerResponse instance
Creates a new crawler with specified targets, role, configuration, and optional schedule. At least one crawl target must be specified, in the s3Targets
field, the jdbcTargets
field, or the DynamoDBTargets
field.
CreateDatabase
- DatabaseInput => Paws::Glue::DatabaseInput
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::CreateDatabase
Returns: a Paws::Glue::CreateDatabaseResponse instance
Creates a new database in a Data Catalog.
CreateDevEndpoint
- EndpointName => Str
- RoleArn => Str
- [Arguments => Paws::Glue::MapValue]
- [ExtraJarsS3Path => Str]
- [ExtraPythonLibsS3Path => Str]
- [GlueVersion => Str]
- [NumberOfNodes => Int]
- [NumberOfWorkers => Int]
- [PublicKey => Str]
- [PublicKeys => ArrayRef[Str|Undef]]
- [SecurityConfiguration => Str]
- [SecurityGroupIds => ArrayRef[Str|Undef]]
- [SubnetId => Str]
- [Tags => Paws::Glue::TagsMap]
- [WorkerType => Str]
Each argument is described in detail in: Paws::Glue::CreateDevEndpoint
Returns: a Paws::Glue::CreateDevEndpointResponse instance
Creates a new development endpoint.
CreateJob
- Command => Paws::Glue::JobCommand
- Name => Str
- Role => Str
- [AllocatedCapacity => Int]
- [Connections => Paws::Glue::ConnectionsList]
- [DefaultArguments => Paws::Glue::GenericMap]
- [Description => Str]
- [ExecutionProperty => Paws::Glue::ExecutionProperty]
- [GlueVersion => Str]
- [LogUri => Str]
- [MaxCapacity => Num]
- [MaxRetries => Int]
- [NonOverridableArguments => Paws::Glue::GenericMap]
- [NotificationProperty => Paws::Glue::NotificationProperty]
- [NumberOfWorkers => Int]
- [SecurityConfiguration => Str]
- [Tags => Paws::Glue::TagsMap]
- [Timeout => Int]
- [WorkerType => Str]
Each argument is described in detail in: Paws::Glue::CreateJob
Returns: a Paws::Glue::CreateJobResponse instance
Creates a new job definition.
CreateMLTransform
- InputRecordTables => ArrayRef[Paws::Glue::GlueTable]
- Name => Str
- Parameters => Paws::Glue::TransformParameters
- Role => Str
- [Description => Str]
- [GlueVersion => Str]
- [MaxCapacity => Num]
- [MaxRetries => Int]
- [NumberOfWorkers => Int]
- [Tags => Paws::Glue::TagsMap]
- [Timeout => Int]
- [TransformEncryption => Paws::Glue::TransformEncryption]
- [WorkerType => Str]
Each argument is described in detail in: Paws::Glue::CreateMLTransform
Returns: a Paws::Glue::CreateMLTransformResponse instance
Creates an Glue machine learning transform. This operation creates the transform and all the necessary parameters to train it.
Call this operation as the first step in the process of using a machine learning transform (such as the FindMatches
transform) for deduplicating data. You can provide an optional Description
, in addition to the parameters that you want to use for your algorithm.
You must also specify certain parameters for the tasks that Glue runs on your behalf as part of learning from your data and creating a high-quality machine learning transform. These parameters include Role
, and optionally, AllocatedCapacity
, Timeout
, and MaxRetries
. For more information, see Jobs (https://docs.aws.amazon.com/glue/latest/dg/aws-glue-api-jobs-job.html).
CreatePartition
- DatabaseName => Str
- PartitionInput => Paws::Glue::PartitionInput
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::CreatePartition
Returns: a Paws::Glue::CreatePartitionResponse instance
Creates a new partition.
CreatePartitionIndex
- DatabaseName => Str
- PartitionIndex => Paws::Glue::PartitionIndex
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::CreatePartitionIndex
Returns: a Paws::Glue::CreatePartitionIndexResponse instance
Creates a specified partition index in an existing table.
CreateRegistry
- RegistryName => Str
- [Description => Str]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::CreateRegistry
Returns: a Paws::Glue::CreateRegistryResponse instance
Creates a new registry which may be used to hold a collection of schemas.
CreateSchema
- DataFormat => Str
- SchemaName => Str
- [Compatibility => Str]
- [Description => Str]
- [RegistryId => Paws::Glue::RegistryId]
- [SchemaDefinition => Str]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::CreateSchema
Returns: a Paws::Glue::CreateSchemaResponse instance
Creates a new schema set and registers the schema definition. Returns an error if the schema set already exists without actually registering the version.
When the schema set is created, a version checkpoint will be set to the first version. Compatibility mode "DISABLED" restricts any additional schema versions from being added after the first schema version. For all other compatibility modes, validation of compatibility settings will be applied only from the second version onwards when the RegisterSchemaVersion
API is used.
When this API is called without a RegistryId
, this will create an entry for a "default-registry" in the registry database tables, if it is not already present.
CreateScript
- [DagEdges => ArrayRef[Paws::Glue::CodeGenEdge]]
- [DagNodes => ArrayRef[Paws::Glue::CodeGenNode]]
- [Language => Str]
Each argument is described in detail in: Paws::Glue::CreateScript
Returns: a Paws::Glue::CreateScriptResponse instance
Transforms a directed acyclic graph (DAG) into code.
CreateSecurityConfiguration
- EncryptionConfiguration => Paws::Glue::EncryptionConfiguration
- Name => Str
Each argument is described in detail in: Paws::Glue::CreateSecurityConfiguration
Returns: a Paws::Glue::CreateSecurityConfigurationResponse instance
Creates a new security configuration. A security configuration is a set of security properties that can be used by Glue. You can use a security configuration to encrypt data at rest. For information about using security configurations in Glue, see Encrypting Data Written by Crawlers, Jobs, and Development Endpoints (https://docs.aws.amazon.com/glue/latest/dg/encryption-security-configuration.html).
CreateTable
- DatabaseName => Str
- TableInput => Paws::Glue::TableInput
- [CatalogId => Str]
- [PartitionIndexes => ArrayRef[Paws::Glue::PartitionIndex]]
Each argument is described in detail in: Paws::Glue::CreateTable
Returns: a Paws::Glue::CreateTableResponse instance
Creates a new table definition in the Data Catalog.
CreateTrigger
- Actions => ArrayRef[Paws::Glue::Action]
- Name => Str
- Type => Str
- [Description => Str]
- [Predicate => Paws::Glue::Predicate]
- [Schedule => Str]
- [StartOnCreation => Bool]
- [Tags => Paws::Glue::TagsMap]
- [WorkflowName => Str]
Each argument is described in detail in: Paws::Glue::CreateTrigger
Returns: a Paws::Glue::CreateTriggerResponse instance
Creates a new trigger.
CreateUserDefinedFunction
- DatabaseName => Str
- FunctionInput => Paws::Glue::UserDefinedFunctionInput
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::CreateUserDefinedFunction
Returns: a Paws::Glue::CreateUserDefinedFunctionResponse instance
Creates a new function definition in the Data Catalog.
CreateWorkflow
- Name => Str
- [DefaultRunProperties => Paws::Glue::WorkflowRunProperties]
- [Description => Str]
- [MaxConcurrentRuns => Int]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::CreateWorkflow
Returns: a Paws::Glue::CreateWorkflowResponse instance
Creates a new workflow.
DeleteClassifier
Each argument is described in detail in: Paws::Glue::DeleteClassifier
Returns: a Paws::Glue::DeleteClassifierResponse instance
Removes a classifier from the Data Catalog.
DeleteColumnStatisticsForPartition
- ColumnName => Str
- DatabaseName => Str
- PartitionValues => ArrayRef[Str|Undef]
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::DeleteColumnStatisticsForPartition
Returns: a Paws::Glue::DeleteColumnStatisticsForPartitionResponse instance
Delete the partition column statistics of a column.
The Identity and Access Management (IAM) permission required for this operation is DeletePartition
.
DeleteColumnStatisticsForTable
Each argument is described in detail in: Paws::Glue::DeleteColumnStatisticsForTable
Returns: a Paws::Glue::DeleteColumnStatisticsForTableResponse instance
Retrieves table statistics of columns.
The Identity and Access Management (IAM) permission required for this operation is DeleteTable
.
DeleteConnection
Each argument is described in detail in: Paws::Glue::DeleteConnection
Returns: a Paws::Glue::DeleteConnectionResponse instance
Deletes a connection from the Data Catalog.
DeleteCrawler
Each argument is described in detail in: Paws::Glue::DeleteCrawler
Returns: a Paws::Glue::DeleteCrawlerResponse instance
Removes a specified crawler from the Glue Data Catalog, unless the crawler state is RUNNING
.
DeleteDatabase
Each argument is described in detail in: Paws::Glue::DeleteDatabase
Returns: a Paws::Glue::DeleteDatabaseResponse instance
Removes a specified database from a Data Catalog.
After completing this operation, you no longer have access to the tables (and all table versions and partitions that might belong to the tables) and the user-defined functions in the deleted database. Glue deletes these "orphaned" resources asynchronously in a timely manner, at the discretion of the service.
To ensure the immediate deletion of all related resources, before calling DeleteDatabase
, use DeleteTableVersion
or BatchDeleteTableVersion
, DeletePartition
or BatchDeletePartition
, DeleteUserDefinedFunction
, and DeleteTable
or BatchDeleteTable
, to delete any resources that belong to the database.
DeleteDevEndpoint
Each argument is described in detail in: Paws::Glue::DeleteDevEndpoint
Returns: a Paws::Glue::DeleteDevEndpointResponse instance
Deletes a specified development endpoint.
DeleteJob
Each argument is described in detail in: Paws::Glue::DeleteJob
Returns: a Paws::Glue::DeleteJobResponse instance
Deletes a specified job definition. If the job definition is not found, no exception is thrown.
DeleteMLTransform
Each argument is described in detail in: Paws::Glue::DeleteMLTransform
Returns: a Paws::Glue::DeleteMLTransformResponse instance
Deletes an Glue machine learning transform. Machine learning transforms are a special type of transform that use machine learning to learn the details of the transformation to be performed by learning from examples provided by humans. These transformations are then saved by Glue. If you no longer need a transform, you can delete it by calling DeleteMLTransforms
. However, any Glue jobs that still reference the deleted transform will no longer succeed.
DeletePartition
Each argument is described in detail in: Paws::Glue::DeletePartition
Returns: a Paws::Glue::DeletePartitionResponse instance
Deletes a specified partition.
DeletePartitionIndex
Each argument is described in detail in: Paws::Glue::DeletePartitionIndex
Returns: a Paws::Glue::DeletePartitionIndexResponse instance
Deletes a specified partition index from an existing table.
DeleteRegistry
- RegistryId => Paws::Glue::RegistryId
Each argument is described in detail in: Paws::Glue::DeleteRegistry
Returns: a Paws::Glue::DeleteRegistryResponse instance
Delete the entire registry including schema and all of its versions. To get the status of the delete operation, you can call the GetRegistry
API after the asynchronous call. Deleting a registry will deactivate all online operations for the registry such as the UpdateRegistry
, CreateSchema
, UpdateSchema
, and RegisterSchemaVersion
APIs.
DeleteResourcePolicy
Each argument is described in detail in: Paws::Glue::DeleteResourcePolicy
Returns: a Paws::Glue::DeleteResourcePolicyResponse instance
Deletes a specified policy.
DeleteSchema
- SchemaId => Paws::Glue::SchemaId
Each argument is described in detail in: Paws::Glue::DeleteSchema
Returns: a Paws::Glue::DeleteSchemaResponse instance
Deletes the entire schema set, including the schema set and all of its versions. To get the status of the delete operation, you can call GetSchema
API after the asynchronous call. Deleting a registry will deactivate all online operations for the schema, such as the GetSchemaByDefinition
, and RegisterSchemaVersion
APIs.
DeleteSchemaVersions
- SchemaId => Paws::Glue::SchemaId
- Versions => Str
Each argument is described in detail in: Paws::Glue::DeleteSchemaVersions
Returns: a Paws::Glue::DeleteSchemaVersionsResponse instance
Remove versions from the specified schema. A version number or range may be supplied. If the compatibility mode forbids deleting of a version that is necessary, such as BACKWARDS_FULL, an error is returned. Calling the GetSchemaVersions
API after this call will list the status of the deleted versions.
When the range of version numbers contain check pointed version, the API will return a 409 conflict and will not proceed with the deletion. You have to remove the checkpoint first using the DeleteSchemaCheckpoint
API before using this API.
You cannot use the DeleteSchemaVersions
API to delete the first schema version in the schema set. The first schema version can only be deleted by the DeleteSchema
API. This operation will also delete the attached SchemaVersionMetadata
under the schema versions. Hard deletes will be enforced on the database.
If the compatibility mode forbids deleting of a version that is necessary, such as BACKWARDS_FULL, an error is returned.
DeleteSecurityConfiguration
Each argument is described in detail in: Paws::Glue::DeleteSecurityConfiguration
Returns: a Paws::Glue::DeleteSecurityConfigurationResponse instance
Deletes a specified security configuration.
DeleteTable
Each argument is described in detail in: Paws::Glue::DeleteTable
Returns: a Paws::Glue::DeleteTableResponse instance
Removes a table definition from the Data Catalog.
After completing this operation, you no longer have access to the table versions and partitions that belong to the deleted table. Glue deletes these "orphaned" resources asynchronously in a timely manner, at the discretion of the service.
To ensure the immediate deletion of all related resources, before calling DeleteTable
, use DeleteTableVersion
or BatchDeleteTableVersion
, and DeletePartition
or BatchDeletePartition
, to delete any resources that belong to the table.
DeleteTableVersion
Each argument is described in detail in: Paws::Glue::DeleteTableVersion
Returns: a Paws::Glue::DeleteTableVersionResponse instance
Deletes a specified version of a table.
DeleteTrigger
Each argument is described in detail in: Paws::Glue::DeleteTrigger
Returns: a Paws::Glue::DeleteTriggerResponse instance
Deletes a specified trigger. If the trigger is not found, no exception is thrown.
DeleteUserDefinedFunction
Each argument is described in detail in: Paws::Glue::DeleteUserDefinedFunction
Returns: a Paws::Glue::DeleteUserDefinedFunctionResponse instance
Deletes an existing function definition from the Data Catalog.
DeleteWorkflow
Each argument is described in detail in: Paws::Glue::DeleteWorkflow
Returns: a Paws::Glue::DeleteWorkflowResponse instance
Deletes a workflow.
GetCatalogImportStatus
Each argument is described in detail in: Paws::Glue::GetCatalogImportStatus
Returns: a Paws::Glue::GetCatalogImportStatusResponse instance
Retrieves the status of a migration operation.
GetClassifier
Each argument is described in detail in: Paws::Glue::GetClassifier
Returns: a Paws::Glue::GetClassifierResponse instance
Retrieve a classifier by name.
GetClassifiers
Each argument is described in detail in: Paws::Glue::GetClassifiers
Returns: a Paws::Glue::GetClassifiersResponse instance
Lists all classifier objects in the Data Catalog.
GetColumnStatisticsForPartition
- ColumnNames => ArrayRef[Str|Undef]
- DatabaseName => Str
- PartitionValues => ArrayRef[Str|Undef]
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::GetColumnStatisticsForPartition
Returns: a Paws::Glue::GetColumnStatisticsForPartitionResponse instance
Retrieves partition statistics of columns.
The Identity and Access Management (IAM) permission required for this operation is GetPartition
.
GetColumnStatisticsForTable
Each argument is described in detail in: Paws::Glue::GetColumnStatisticsForTable
Returns: a Paws::Glue::GetColumnStatisticsForTableResponse instance
Retrieves table statistics of columns.
The Identity and Access Management (IAM) permission required for this operation is GetTable
.
GetConnection
Each argument is described in detail in: Paws::Glue::GetConnection
Returns: a Paws::Glue::GetConnectionResponse instance
Retrieves a connection definition from the Data Catalog.
GetConnections
- [CatalogId => Str]
- [Filter => Paws::Glue::GetConnectionsFilter]
- [HidePassword => Bool]
- [MaxResults => Int]
- [NextToken => Str]
Each argument is described in detail in: Paws::Glue::GetConnections
Returns: a Paws::Glue::GetConnectionsResponse instance
Retrieves a list of connection definitions from the Data Catalog.
GetCrawler
Each argument is described in detail in: Paws::Glue::GetCrawler
Returns: a Paws::Glue::GetCrawlerResponse instance
Retrieves metadata for a specified crawler.
GetCrawlerMetrics
Each argument is described in detail in: Paws::Glue::GetCrawlerMetrics
Returns: a Paws::Glue::GetCrawlerMetricsResponse instance
Retrieves metrics about specified crawlers.
GetCrawlers
Each argument is described in detail in: Paws::Glue::GetCrawlers
Returns: a Paws::Glue::GetCrawlersResponse instance
Retrieves metadata for all crawlers defined in the customer account.
GetDatabase
Each argument is described in detail in: Paws::Glue::GetDatabase
Returns: a Paws::Glue::GetDatabaseResponse instance
Retrieves the definition of a specified database.
GetDatabases
Each argument is described in detail in: Paws::Glue::GetDatabases
Returns: a Paws::Glue::GetDatabasesResponse instance
Retrieves all databases defined in a given Data Catalog.
GetDataCatalogEncryptionSettings
Each argument is described in detail in: Paws::Glue::GetDataCatalogEncryptionSettings
Returns: a Paws::Glue::GetDataCatalogEncryptionSettingsResponse instance
Retrieves the security configuration for a specified catalog.
GetDataflowGraph
Each argument is described in detail in: Paws::Glue::GetDataflowGraph
Returns: a Paws::Glue::GetDataflowGraphResponse instance
Transforms a Python script into a directed acyclic graph (DAG).
GetDevEndpoint
Each argument is described in detail in: Paws::Glue::GetDevEndpoint
Returns: a Paws::Glue::GetDevEndpointResponse instance
Retrieves information about a specified development endpoint.
When you create a development endpoint in a virtual private cloud (VPC), Glue returns only a private IP address, and the public IP address field is not populated. When you create a non-VPC development endpoint, Glue returns only a public IP address.
GetDevEndpoints
Each argument is described in detail in: Paws::Glue::GetDevEndpoints
Returns: a Paws::Glue::GetDevEndpointsResponse instance
Retrieves all the development endpoints in this AWS account.
When you create a development endpoint in a virtual private cloud (VPC), Glue returns only a private IP address and the public IP address field is not populated. When you create a non-VPC development endpoint, Glue returns only a public IP address.
GetJob
Each argument is described in detail in: Paws::Glue::GetJob
Returns: a Paws::Glue::GetJobResponse instance
Retrieves an existing job definition.
GetJobBookmark
Each argument is described in detail in: Paws::Glue::GetJobBookmark
Returns: a Paws::Glue::GetJobBookmarkResponse instance
Returns information on a job bookmark entry.
GetJobRun
Each argument is described in detail in: Paws::Glue::GetJobRun
Returns: a Paws::Glue::GetJobRunResponse instance
Retrieves the metadata for a given job run.
GetJobRuns
Each argument is described in detail in: Paws::Glue::GetJobRuns
Returns: a Paws::Glue::GetJobRunsResponse instance
Retrieves metadata for all runs of a given job definition.
GetJobs
Each argument is described in detail in: Paws::Glue::GetJobs
Returns: a Paws::Glue::GetJobsResponse instance
Retrieves all current job definitions.
GetMapping
- Source => Paws::Glue::CatalogEntry
- [Location => Paws::Glue::Location]
- [Sinks => ArrayRef[Paws::Glue::CatalogEntry]]
Each argument is described in detail in: Paws::Glue::GetMapping
Returns: a Paws::Glue::GetMappingResponse instance
Creates mappings.
GetMLTaskRun
Each argument is described in detail in: Paws::Glue::GetMLTaskRun
Returns: a Paws::Glue::GetMLTaskRunResponse instance
Gets details for a specific task run on a machine learning transform. Machine learning task runs are asynchronous tasks that Glue runs on your behalf as part of various machine learning workflows. You can check the stats of any task run by calling GetMLTaskRun
with the TaskRunID
and its parent transform's TransformID
.
GetMLTaskRuns
- TransformId => Str
- [Filter => Paws::Glue::TaskRunFilterCriteria]
- [MaxResults => Int]
- [NextToken => Str]
- [Sort => Paws::Glue::TaskRunSortCriteria]
Each argument is described in detail in: Paws::Glue::GetMLTaskRuns
Returns: a Paws::Glue::GetMLTaskRunsResponse instance
Gets a list of runs for a machine learning transform. Machine learning task runs are asynchronous tasks that Glue runs on your behalf as part of various machine learning workflows. You can get a sortable, filterable list of machine learning task runs by calling GetMLTaskRuns
with their parent transform's TransformID
and other optional parameters as documented in this section.
This operation returns a list of historic runs and must be paginated.
GetMLTransform
Each argument is described in detail in: Paws::Glue::GetMLTransform
Returns: a Paws::Glue::GetMLTransformResponse instance
Gets an Glue machine learning transform artifact and all its corresponding metadata. Machine learning transforms are a special type of transform that use machine learning to learn the details of the transformation to be performed by learning from examples provided by humans. These transformations are then saved by Glue. You can retrieve their metadata by calling GetMLTransform
.
GetMLTransforms
- [Filter => Paws::Glue::TransformFilterCriteria]
- [MaxResults => Int]
- [NextToken => Str]
- [Sort => Paws::Glue::TransformSortCriteria]
Each argument is described in detail in: Paws::Glue::GetMLTransforms
Returns: a Paws::Glue::GetMLTransformsResponse instance
Gets a sortable, filterable list of existing Glue machine learning transforms. Machine learning transforms are a special type of transform that use machine learning to learn the details of the transformation to be performed by learning from examples provided by humans. These transformations are then saved by Glue, and you can retrieve their metadata by calling GetMLTransforms
.
GetPartition
Each argument is described in detail in: Paws::Glue::GetPartition
Returns: a Paws::Glue::GetPartitionResponse instance
Retrieves information about a specified partition.
GetPartitionIndexes
Each argument is described in detail in: Paws::Glue::GetPartitionIndexes
Returns: a Paws::Glue::GetPartitionIndexesResponse instance
Retrieves the partition indexes associated with a table.
GetPartitions
- DatabaseName => Str
- TableName => Str
- [CatalogId => Str]
- [ExcludeColumnSchema => Bool]
- [Expression => Str]
- [MaxResults => Int]
- [NextToken => Str]
- [Segment => Paws::Glue::Segment]
Each argument is described in detail in: Paws::Glue::GetPartitions
Returns: a Paws::Glue::GetPartitionsResponse instance
Retrieves information about the partitions in a table.
GetPlan
- Mapping => ArrayRef[Paws::Glue::MappingEntry]
- Source => Paws::Glue::CatalogEntry
- [AdditionalPlanOptionsMap => Paws::Glue::AdditionalPlanOptionsMap]
- [Language => Str]
- [Location => Paws::Glue::Location]
- [Sinks => ArrayRef[Paws::Glue::CatalogEntry]]
Each argument is described in detail in: Paws::Glue::GetPlan
Returns: a Paws::Glue::GetPlanResponse instance
Gets code to perform a specified mapping.
GetRegistry
- RegistryId => Paws::Glue::RegistryId
Each argument is described in detail in: Paws::Glue::GetRegistry
Returns: a Paws::Glue::GetRegistryResponse instance
Describes the specified registry in detail.
GetResourcePolicies
Each argument is described in detail in: Paws::Glue::GetResourcePolicies
Returns: a Paws::Glue::GetResourcePoliciesResponse instance
Retrieves the resource policies set on individual resources by Resource Access Manager during cross-account permission grants. Also retrieves the Data Catalog resource policy.
If you enabled metadata encryption in Data Catalog settings, and you do not have permission on the KMS key, the operation can't return the Data Catalog resource policy.
GetResourcePolicy
Each argument is described in detail in: Paws::Glue::GetResourcePolicy
Returns: a Paws::Glue::GetResourcePolicyResponse instance
Retrieves a specified resource policy.
GetSchema
- SchemaId => Paws::Glue::SchemaId
Each argument is described in detail in: Paws::Glue::GetSchema
Returns: a Paws::Glue::GetSchemaResponse instance
Describes the specified schema in detail.
GetSchemaByDefinition
- SchemaDefinition => Str
- SchemaId => Paws::Glue::SchemaId
Each argument is described in detail in: Paws::Glue::GetSchemaByDefinition
Returns: a Paws::Glue::GetSchemaByDefinitionResponse instance
Retrieves a schema by the SchemaDefinition
. The schema definition is sent to the Schema Registry, canonicalized, and hashed. If the hash is matched within the scope of the SchemaName
or ARN (or the default registry, if none is supplied), that schema’s metadata is returned. Otherwise, a 404 or NotFound error is returned. Schema versions in Deleted
statuses will not be included in the results.
GetSchemaVersion
- [SchemaId => Paws::Glue::SchemaId]
- [SchemaVersionId => Str]
- [SchemaVersionNumber => Paws::Glue::SchemaVersionNumber]
Each argument is described in detail in: Paws::Glue::GetSchemaVersion
Returns: a Paws::Glue::GetSchemaVersionResponse instance
Get the specified schema by its unique ID assigned when a version of the schema is created or registered. Schema versions in Deleted status will not be included in the results.
GetSchemaVersionsDiff
- FirstSchemaVersionNumber => Paws::Glue::SchemaVersionNumber
- SchemaDiffType => Str
- SchemaId => Paws::Glue::SchemaId
- SecondSchemaVersionNumber => Paws::Glue::SchemaVersionNumber
Each argument is described in detail in: Paws::Glue::GetSchemaVersionsDiff
Returns: a Paws::Glue::GetSchemaVersionsDiffResponse instance
Fetches the schema version difference in the specified difference type between two stored schema versions in the Schema Registry.
This API allows you to compare two schema versions between two schema definitions under the same schema.
GetSecurityConfiguration
Each argument is described in detail in: Paws::Glue::GetSecurityConfiguration
Returns: a Paws::Glue::GetSecurityConfigurationResponse instance
Retrieves a specified security configuration.
GetSecurityConfigurations
Each argument is described in detail in: Paws::Glue::GetSecurityConfigurations
Returns: a Paws::Glue::GetSecurityConfigurationsResponse instance
Retrieves a list of all security configurations.
GetTable
Each argument is described in detail in: Paws::Glue::GetTable
Returns: a Paws::Glue::GetTableResponse instance
Retrieves the Table
definition in a Data Catalog for a specified table.
GetTables
Each argument is described in detail in: Paws::Glue::GetTables
Returns: a Paws::Glue::GetTablesResponse instance
Retrieves the definitions of some or all of the tables in a given Database
.
GetTableVersion
Each argument is described in detail in: Paws::Glue::GetTableVersion
Returns: a Paws::Glue::GetTableVersionResponse instance
Retrieves a specified version of a table.
GetTableVersions
Each argument is described in detail in: Paws::Glue::GetTableVersions
Returns: a Paws::Glue::GetTableVersionsResponse instance
Retrieves a list of strings that identify available versions of a specified table.
GetTags
Each argument is described in detail in: Paws::Glue::GetTags
Returns: a Paws::Glue::GetTagsResponse instance
Retrieves a list of tags associated with a resource.
GetTrigger
Each argument is described in detail in: Paws::Glue::GetTrigger
Returns: a Paws::Glue::GetTriggerResponse instance
Retrieves the definition of a trigger.
GetTriggers
Each argument is described in detail in: Paws::Glue::GetTriggers
Returns: a Paws::Glue::GetTriggersResponse instance
Gets all the triggers associated with a job.
GetUserDefinedFunction
Each argument is described in detail in: Paws::Glue::GetUserDefinedFunction
Returns: a Paws::Glue::GetUserDefinedFunctionResponse instance
Retrieves a specified function definition from the Data Catalog.
GetUserDefinedFunctions
Each argument is described in detail in: Paws::Glue::GetUserDefinedFunctions
Returns: a Paws::Glue::GetUserDefinedFunctionsResponse instance
Retrieves multiple function definitions from the Data Catalog.
GetWorkflow
Each argument is described in detail in: Paws::Glue::GetWorkflow
Returns: a Paws::Glue::GetWorkflowResponse instance
Retrieves resource metadata for a workflow.
GetWorkflowRun
Each argument is described in detail in: Paws::Glue::GetWorkflowRun
Returns: a Paws::Glue::GetWorkflowRunResponse instance
Retrieves the metadata for a given workflow run.
GetWorkflowRunProperties
Each argument is described in detail in: Paws::Glue::GetWorkflowRunProperties
Returns: a Paws::Glue::GetWorkflowRunPropertiesResponse instance
Retrieves the workflow run properties which were set during the run.
GetWorkflowRuns
Each argument is described in detail in: Paws::Glue::GetWorkflowRuns
Returns: a Paws::Glue::GetWorkflowRunsResponse instance
Retrieves metadata for all runs of a given workflow.
ImportCatalogToGlue
Each argument is described in detail in: Paws::Glue::ImportCatalogToGlue
Returns: a Paws::Glue::ImportCatalogToGlueResponse instance
Imports an existing Amazon Athena Data Catalog to Glue.
ListCrawlers
- [MaxResults => Int]
- [NextToken => Str]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::ListCrawlers
Returns: a Paws::Glue::ListCrawlersResponse instance
Retrieves the names of all crawler resources in this Amazon Web Services account, or the resources with the specified tag. This operation allows you to see which resources are available in your account, and their names.
This operation takes the optional Tags
field, which you can use as a filter on the response so that tagged resources can be retrieved as a group. If you choose to use tags filtering, only resources with the tag are retrieved.
ListDevEndpoints
- [MaxResults => Int]
- [NextToken => Str]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::ListDevEndpoints
Returns: a Paws::Glue::ListDevEndpointsResponse instance
Retrieves the names of all DevEndpoint
resources in this Amazon Web Services account, or the resources with the specified tag. This operation allows you to see which resources are available in your account, and their names.
This operation takes the optional Tags
field, which you can use as a filter on the response so that tagged resources can be retrieved as a group. If you choose to use tags filtering, only resources with the tag are retrieved.
ListJobs
- [MaxResults => Int]
- [NextToken => Str]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::ListJobs
Returns: a Paws::Glue::ListJobsResponse instance
Retrieves the names of all job resources in this Amazon Web Services account, or the resources with the specified tag. This operation allows you to see which resources are available in your account, and their names.
This operation takes the optional Tags
field, which you can use as a filter on the response so that tagged resources can be retrieved as a group. If you choose to use tags filtering, only resources with the tag are retrieved.
ListMLTransforms
- [Filter => Paws::Glue::TransformFilterCriteria]
- [MaxResults => Int]
- [NextToken => Str]
- [Sort => Paws::Glue::TransformSortCriteria]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::ListMLTransforms
Returns: a Paws::Glue::ListMLTransformsResponse instance
Retrieves a sortable, filterable list of existing Glue machine learning transforms in this Amazon Web Services account, or the resources with the specified tag. This operation takes the optional Tags
field, which you can use as a filter of the responses so that tagged resources can be retrieved as a group. If you choose to use tag filtering, only resources with the tags are retrieved.
ListRegistries
Each argument is described in detail in: Paws::Glue::ListRegistries
Returns: a Paws::Glue::ListRegistriesResponse instance
Returns a list of registries that you have created, with minimal registry information. Registries in the Deleting
status will not be included in the results. Empty results will be returned if there are no registries available.
ListSchemas
- [MaxResults => Int]
- [NextToken => Str]
- [RegistryId => Paws::Glue::RegistryId]
Each argument is described in detail in: Paws::Glue::ListSchemas
Returns: a Paws::Glue::ListSchemasResponse instance
Returns a list of schemas with minimal details. Schemas in Deleting status will not be included in the results. Empty results will be returned if there are no schemas available.
When the RegistryId
is not provided, all the schemas across registries will be part of the API response.
ListSchemaVersions
- SchemaId => Paws::Glue::SchemaId
- [MaxResults => Int]
- [NextToken => Str]
Each argument is described in detail in: Paws::Glue::ListSchemaVersions
Returns: a Paws::Glue::ListSchemaVersionsResponse instance
Returns a list of schema versions that you have created, with minimal information. Schema versions in Deleted status will not be included in the results. Empty results will be returned if there are no schema versions available.
ListTriggers
- [DependentJobName => Str]
- [MaxResults => Int]
- [NextToken => Str]
- [Tags => Paws::Glue::TagsMap]
Each argument is described in detail in: Paws::Glue::ListTriggers
Returns: a Paws::Glue::ListTriggersResponse instance
Retrieves the names of all trigger resources in this Amazon Web Services account, or the resources with the specified tag. This operation allows you to see which resources are available in your account, and their names.
This operation takes the optional Tags
field, which you can use as a filter on the response so that tagged resources can be retrieved as a group. If you choose to use tags filtering, only resources with the tag are retrieved.
ListWorkflows
Each argument is described in detail in: Paws::Glue::ListWorkflows
Returns: a Paws::Glue::ListWorkflowsResponse instance
Lists names of workflows created in the account.
PutDataCatalogEncryptionSettings
- DataCatalogEncryptionSettings => Paws::Glue::DataCatalogEncryptionSettings
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::PutDataCatalogEncryptionSettings
Returns: a Paws::Glue::PutDataCatalogEncryptionSettingsResponse instance
Sets the security configuration for a specified catalog. After the configuration has been set, the specified encryption is applied to every catalog write thereafter.
PutResourcePolicy
- PolicyInJson => Str
- [EnableHybrid => Str]
- [PolicyExistsCondition => Str]
- [PolicyHashCondition => Str]
- [ResourceArn => Str]
Each argument is described in detail in: Paws::Glue::PutResourcePolicy
Returns: a Paws::Glue::PutResourcePolicyResponse instance
Sets the Data Catalog resource policy for access control.
PutSchemaVersionMetadata
- MetadataKeyValue => Paws::Glue::MetadataKeyValuePair
- [SchemaId => Paws::Glue::SchemaId]
- [SchemaVersionId => Str]
- [SchemaVersionNumber => Paws::Glue::SchemaVersionNumber]
Each argument is described in detail in: Paws::Glue::PutSchemaVersionMetadata
Returns: a Paws::Glue::PutSchemaVersionMetadataResponse instance
Puts the metadata key value pair for a specified schema version ID. A maximum of 10 key value pairs will be allowed per schema version. They can be added over one or more calls.
PutWorkflowRunProperties
- Name => Str
- RunId => Str
- RunProperties => Paws::Glue::WorkflowRunProperties
Each argument is described in detail in: Paws::Glue::PutWorkflowRunProperties
Returns: a Paws::Glue::PutWorkflowRunPropertiesResponse instance
Puts the specified workflow run properties for the given workflow run. If a property already exists for the specified run, then it overrides the value otherwise adds the property to existing properties.
QuerySchemaVersionMetadata
- [MaxResults => Int]
- [MetadataList => ArrayRef[Paws::Glue::MetadataKeyValuePair]]
- [NextToken => Str]
- [SchemaId => Paws::Glue::SchemaId]
- [SchemaVersionId => Str]
- [SchemaVersionNumber => Paws::Glue::SchemaVersionNumber]
Each argument is described in detail in: Paws::Glue::QuerySchemaVersionMetadata
Returns: a Paws::Glue::QuerySchemaVersionMetadataResponse instance
Queries for the schema version metadata information.
RegisterSchemaVersion
- SchemaDefinition => Str
- SchemaId => Paws::Glue::SchemaId
Each argument is described in detail in: Paws::Glue::RegisterSchemaVersion
Returns: a Paws::Glue::RegisterSchemaVersionResponse instance
Adds a new version to the existing schema. Returns an error if new version of schema does not meet the compatibility requirements of the schema set. This API will not create a new schema set and will return a 404 error if the schema set is not already present in the Schema Registry.
If this is the first schema definition to be registered in the Schema Registry, this API will store the schema version and return immediately. Otherwise, this call has the potential to run longer than other operations due to compatibility modes. You can call the GetSchemaVersion
API with the SchemaVersionId
to check compatibility modes.
If the same schema definition is already stored in Schema Registry as a version, the schema ID of the existing schema is returned to the caller.
RemoveSchemaVersionMetadata
- MetadataKeyValue => Paws::Glue::MetadataKeyValuePair
- [SchemaId => Paws::Glue::SchemaId]
- [SchemaVersionId => Str]
- [SchemaVersionNumber => Paws::Glue::SchemaVersionNumber]
Each argument is described in detail in: Paws::Glue::RemoveSchemaVersionMetadata
Returns: a Paws::Glue::RemoveSchemaVersionMetadataResponse instance
Removes a key value pair from the schema version metadata for the specified schema version ID.
ResetJobBookmark
Each argument is described in detail in: Paws::Glue::ResetJobBookmark
Returns: a Paws::Glue::ResetJobBookmarkResponse instance
Resets a bookmark entry.
ResumeWorkflowRun
Each argument is described in detail in: Paws::Glue::ResumeWorkflowRun
Returns: a Paws::Glue::ResumeWorkflowRunResponse instance
Restarts selected nodes of a previous partially completed workflow run and resumes the workflow run. The selected nodes and all nodes that are downstream from the selected nodes are run.
SearchTables
- [CatalogId => Str]
- [Filters => ArrayRef[Paws::Glue::PropertyPredicate]]
- [MaxResults => Int]
- [NextToken => Str]
- [SearchText => Str]
- [SortCriteria => ArrayRef[Paws::Glue::SortCriterion]]
Each argument is described in detail in: Paws::Glue::SearchTables
Returns: a Paws::Glue::SearchTablesResponse instance
Searches a set of tables based on properties in the table metadata as well as on the parent database. You can search against text or filter conditions.
You can only get tables that you have access to based on the security policies defined in Lake Formation. You need at least a read-only access to the table for it to be returned. If you do not have access to all the columns in the table, these columns will not be searched against when returning the list of tables back to you. If you have access to the columns but not the data in the columns, those columns and the associated metadata for those columns will be included in the search.
StartCrawler
Each argument is described in detail in: Paws::Glue::StartCrawler
Returns: a Paws::Glue::StartCrawlerResponse instance
Starts a crawl using the specified crawler, regardless of what is scheduled. If the crawler is already running, returns a CrawlerRunningException (https://docs.aws.amazon.com/glue/latest/dg/aws-glue-api-exceptions.html#aws-glue-api-exceptions-CrawlerRunningException).
StartCrawlerSchedule
Each argument is described in detail in: Paws::Glue::StartCrawlerSchedule
Returns: a Paws::Glue::StartCrawlerScheduleResponse instance
Changes the schedule state of the specified crawler to SCHEDULED
, unless the crawler is already running or the schedule state is already SCHEDULED
.
StartExportLabelsTaskRun
Each argument is described in detail in: Paws::Glue::StartExportLabelsTaskRun
Returns: a Paws::Glue::StartExportLabelsTaskRunResponse instance
Begins an asynchronous task to export all labeled data for a particular transform. This task is the only label-related API call that is not part of the typical active learning workflow. You typically use StartExportLabelsTaskRun
when you want to work with all of your existing labels at the same time, such as when you want to remove or change labels that were previously submitted as truth. This API operation accepts the TransformId
whose labels you want to export and an Amazon Simple Storage Service (Amazon S3) path to export the labels to. The operation returns a TaskRunId
. You can check on the status of your task run by calling the GetMLTaskRun
API.
StartImportLabelsTaskRun
Each argument is described in detail in: Paws::Glue::StartImportLabelsTaskRun
Returns: a Paws::Glue::StartImportLabelsTaskRunResponse instance
Enables you to provide additional labels (examples of truth) to be used to teach the machine learning transform and improve its quality. This API operation is generally used as part of the active learning workflow that starts with the StartMLLabelingSetGenerationTaskRun
call and that ultimately results in improving the quality of your machine learning transform.
After the StartMLLabelingSetGenerationTaskRun
finishes, Glue machine learning will have generated a series of questions for humans to answer. (Answering these questions is often called 'labeling' in the machine learning workflows). In the case of the FindMatches
transform, these questions are of the form, “What is the correct way to group these rows together into groups composed entirely of matching records?” After the labeling process is finished, users upload their answers/labels with a call to StartImportLabelsTaskRun
. After StartImportLabelsTaskRun
finishes, all future runs of the machine learning transform use the new and improved labels and perform a higher-quality transformation.
By default, StartMLLabelingSetGenerationTaskRun
continually learns from and combines all labels that you upload unless you set Replace
to true. If you set Replace
to true, StartImportLabelsTaskRun
deletes and forgets all previously uploaded labels and learns only from the exact set that you upload. Replacing labels can be helpful if you realize that you previously uploaded incorrect labels, and you believe that they are having a negative effect on your transform quality.
You can check on the status of your task run by calling the GetMLTaskRun
operation.
StartJobRun
- JobName => Str
- [AllocatedCapacity => Int]
- [Arguments => Paws::Glue::GenericMap]
- [JobRunId => Str]
- [MaxCapacity => Num]
- [NotificationProperty => Paws::Glue::NotificationProperty]
- [NumberOfWorkers => Int]
- [SecurityConfiguration => Str]
- [Timeout => Int]
- [WorkerType => Str]
Each argument is described in detail in: Paws::Glue::StartJobRun
Returns: a Paws::Glue::StartJobRunResponse instance
Starts a job run using a job definition.
StartMLEvaluationTaskRun
Each argument is described in detail in: Paws::Glue::StartMLEvaluationTaskRun
Returns: a Paws::Glue::StartMLEvaluationTaskRunResponse instance
Starts a task to estimate the quality of the transform.
When you provide label sets as examples of truth, Glue machine learning uses some of those examples to learn from them. The rest of the labels are used as a test to estimate quality.
Returns a unique identifier for the run. You can call GetMLTaskRun
to get more information about the stats of the EvaluationTaskRun
.
StartMLLabelingSetGenerationTaskRun
Each argument is described in detail in: Paws::Glue::StartMLLabelingSetGenerationTaskRun
Returns: a Paws::Glue::StartMLLabelingSetGenerationTaskRunResponse instance
Starts the active learning workflow for your machine learning transform to improve the transform's quality by generating label sets and adding labels.
When the StartMLLabelingSetGenerationTaskRun
finishes, Glue will have generated a "labeling set" or a set of questions for humans to answer.
In the case of the FindMatches
transform, these questions are of the form, “What is the correct way to group these rows together into groups composed entirely of matching records?”
After the labeling process is finished, you can upload your labels with a call to StartImportLabelsTaskRun
. After StartImportLabelsTaskRun
finishes, all future runs of the machine learning transform will use the new and improved labels and perform a higher-quality transformation.
StartTrigger
Each argument is described in detail in: Paws::Glue::StartTrigger
Returns: a Paws::Glue::StartTriggerResponse instance
Starts an existing trigger. See Triggering Jobs (https://docs.aws.amazon.com/glue/latest/dg/trigger-job.html) for information about how different types of trigger are started.
StartWorkflowRun
Each argument is described in detail in: Paws::Glue::StartWorkflowRun
Returns: a Paws::Glue::StartWorkflowRunResponse instance
Starts a new run of the specified workflow.
StopCrawler
Each argument is described in detail in: Paws::Glue::StopCrawler
Returns: a Paws::Glue::StopCrawlerResponse instance
If the specified crawler is running, stops the crawl.
StopCrawlerSchedule
Each argument is described in detail in: Paws::Glue::StopCrawlerSchedule
Returns: a Paws::Glue::StopCrawlerScheduleResponse instance
Sets the schedule state of the specified crawler to NOT_SCHEDULED
, but does not stop the crawler if it is already running.
StopTrigger
Each argument is described in detail in: Paws::Glue::StopTrigger
Returns: a Paws::Glue::StopTriggerResponse instance
Stops a specified trigger.
StopWorkflowRun
Each argument is described in detail in: Paws::Glue::StopWorkflowRun
Returns: a Paws::Glue::StopWorkflowRunResponse instance
Stops the execution of the specified workflow run.
TagResource
- ResourceArn => Str
- TagsToAdd => Paws::Glue::TagsMap
Each argument is described in detail in: Paws::Glue::TagResource
Returns: a Paws::Glue::TagResourceResponse instance
Adds tags to a resource. A tag is a label you can assign to an Amazon Web Services resource. In Glue, you can tag only certain resources. For information about what resources you can tag, see Amazon Web Services Tags in Glue (https://docs.aws.amazon.com/glue/latest/dg/monitor-tags.html).
UntagResource
Each argument is described in detail in: Paws::Glue::UntagResource
Returns: a Paws::Glue::UntagResourceResponse instance
Removes tags from a resource.
UpdateClassifier
- [CsvClassifier => Paws::Glue::UpdateCsvClassifierRequest]
- [GrokClassifier => Paws::Glue::UpdateGrokClassifierRequest]
- [JsonClassifier => Paws::Glue::UpdateJsonClassifierRequest]
- [XMLClassifier => Paws::Glue::UpdateXMLClassifierRequest]
Each argument is described in detail in: Paws::Glue::UpdateClassifier
Returns: a Paws::Glue::UpdateClassifierResponse instance
Modifies an existing classifier (a GrokClassifier
, an XMLClassifier
, a JsonClassifier
, or a CsvClassifier
, depending on which field is present).
UpdateColumnStatisticsForPartition
- ColumnStatisticsList => ArrayRef[Paws::Glue::ColumnStatistics]
- DatabaseName => Str
- PartitionValues => ArrayRef[Str|Undef]
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::UpdateColumnStatisticsForPartition
Returns: a Paws::Glue::UpdateColumnStatisticsForPartitionResponse instance
Creates or updates partition statistics of columns.
The Identity and Access Management (IAM) permission required for this operation is UpdatePartition
.
UpdateColumnStatisticsForTable
- ColumnStatisticsList => ArrayRef[Paws::Glue::ColumnStatistics]
- DatabaseName => Str
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::UpdateColumnStatisticsForTable
Returns: a Paws::Glue::UpdateColumnStatisticsForTableResponse instance
Creates or updates table statistics of columns.
The Identity and Access Management (IAM) permission required for this operation is UpdateTable
.
UpdateConnection
- ConnectionInput => Paws::Glue::ConnectionInput
- Name => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::UpdateConnection
Returns: a Paws::Glue::UpdateConnectionResponse instance
Updates a connection definition in the Data Catalog.
UpdateCrawler
- Name => Str
- [Classifiers => ArrayRef[Str|Undef]]
- [Configuration => Str]
- [CrawlerSecurityConfiguration => Str]
- [DatabaseName => Str]
- [Description => Str]
- [LineageConfiguration => Paws::Glue::LineageConfiguration]
- [RecrawlPolicy => Paws::Glue::RecrawlPolicy]
- [Role => Str]
- [Schedule => Str]
- [SchemaChangePolicy => Paws::Glue::SchemaChangePolicy]
- [TablePrefix => Str]
- [Targets => Paws::Glue::CrawlerTargets]
Each argument is described in detail in: Paws::Glue::UpdateCrawler
Returns: a Paws::Glue::UpdateCrawlerResponse instance
Updates a crawler. If a crawler is running, you must stop it using StopCrawler
before updating it.
UpdateCrawlerSchedule
Each argument is described in detail in: Paws::Glue::UpdateCrawlerSchedule
Returns: a Paws::Glue::UpdateCrawlerScheduleResponse instance
Updates the schedule of a crawler using a cron
expression.
UpdateDatabase
- DatabaseInput => Paws::Glue::DatabaseInput
- Name => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::UpdateDatabase
Returns: a Paws::Glue::UpdateDatabaseResponse instance
Updates an existing database definition in a Data Catalog.
UpdateDevEndpoint
- EndpointName => Str
- [AddArguments => Paws::Glue::MapValue]
- [AddPublicKeys => ArrayRef[Str|Undef]]
- [CustomLibraries => Paws::Glue::DevEndpointCustomLibraries]
- [DeleteArguments => ArrayRef[Str|Undef]]
- [DeletePublicKeys => ArrayRef[Str|Undef]]
- [PublicKey => Str]
- [UpdateEtlLibraries => Bool]
Each argument is described in detail in: Paws::Glue::UpdateDevEndpoint
Returns: a Paws::Glue::UpdateDevEndpointResponse instance
Updates a specified development endpoint.
UpdateJob
- JobName => Str
- JobUpdate => Paws::Glue::JobUpdate
Each argument is described in detail in: Paws::Glue::UpdateJob
Returns: a Paws::Glue::UpdateJobResponse instance
Updates an existing job definition.
UpdateMLTransform
- TransformId => Str
- [Description => Str]
- [GlueVersion => Str]
- [MaxCapacity => Num]
- [MaxRetries => Int]
- [Name => Str]
- [NumberOfWorkers => Int]
- [Parameters => Paws::Glue::TransformParameters]
- [Role => Str]
- [Timeout => Int]
- [WorkerType => Str]
Each argument is described in detail in: Paws::Glue::UpdateMLTransform
Returns: a Paws::Glue::UpdateMLTransformResponse instance
Updates an existing machine learning transform. Call this operation to tune the algorithm parameters to achieve better results.
After calling this operation, you can call the StartMLEvaluationTaskRun
operation to assess how well your new parameters achieved your goals (such as improving the quality of your machine learning transform, or making it more cost-effective).
UpdatePartition
- DatabaseName => Str
- PartitionInput => Paws::Glue::PartitionInput
- PartitionValueList => ArrayRef[Str|Undef]
- TableName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::UpdatePartition
Returns: a Paws::Glue::UpdatePartitionResponse instance
Updates a partition.
UpdateRegistry
- Description => Str
- RegistryId => Paws::Glue::RegistryId
Each argument is described in detail in: Paws::Glue::UpdateRegistry
Returns: a Paws::Glue::UpdateRegistryResponse instance
Updates an existing registry which is used to hold a collection of schemas. The updated properties relate to the registry, and do not modify any of the schemas within the registry.
UpdateSchema
- SchemaId => Paws::Glue::SchemaId
- [Compatibility => Str]
- [Description => Str]
- [SchemaVersionNumber => Paws::Glue::SchemaVersionNumber]
Each argument is described in detail in: Paws::Glue::UpdateSchema
Returns: a Paws::Glue::UpdateSchemaResponse instance
Updates the description, compatibility setting, or version checkpoint for a schema set.
For updating the compatibility setting, the call will not validate compatibility for the entire set of schema versions with the new compatibility setting. If the value for Compatibility
is provided, the VersionNumber
(a checkpoint) is also required. The API will validate the checkpoint version number for consistency.
If the value for the VersionNumber
(checkpoint) is provided, Compatibility
is optional and this can be used to set/reset a checkpoint for the schema.
This update will happen only if the schema is in the AVAILABLE state.
UpdateTable
- DatabaseName => Str
- TableInput => Paws::Glue::TableInput
- [CatalogId => Str]
- [SkipArchive => Bool]
Each argument is described in detail in: Paws::Glue::UpdateTable
Returns: a Paws::Glue::UpdateTableResponse instance
Updates a metadata table in the Data Catalog.
UpdateTrigger
- Name => Str
- TriggerUpdate => Paws::Glue::TriggerUpdate
Each argument is described in detail in: Paws::Glue::UpdateTrigger
Returns: a Paws::Glue::UpdateTriggerResponse instance
Updates a trigger definition.
UpdateUserDefinedFunction
- DatabaseName => Str
- FunctionInput => Paws::Glue::UserDefinedFunctionInput
- FunctionName => Str
- [CatalogId => Str]
Each argument is described in detail in: Paws::Glue::UpdateUserDefinedFunction
Returns: a Paws::Glue::UpdateUserDefinedFunctionResponse instance
Updates an existing function definition in the Data Catalog.
UpdateWorkflow
- Name => Str
- [DefaultRunProperties => Paws::Glue::WorkflowRunProperties]
- [Description => Str]
- [MaxConcurrentRuns => Int]
Each argument is described in detail in: Paws::Glue::UpdateWorkflow
Returns: a Paws::Glue::UpdateWorkflowResponse instance
Updates an existing workflow.
PAGINATORS
Paginator methods are helpers that repetively call methods that return partial results
GetAllClassifiers(sub { },[MaxResults => Int, NextToken => Str])
GetAllClassifiers([MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- Classifiers, passing the object as the first parameter, and the string 'Classifiers' as the second parameter
If not, it will return a a Paws::Glue::GetClassifiersResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllConnections(sub { },[CatalogId => Str, Filter => Paws::Glue::GetConnectionsFilter, HidePassword => Bool, MaxResults => Int, NextToken => Str])
GetAllConnections([CatalogId => Str, Filter => Paws::Glue::GetConnectionsFilter, HidePassword => Bool, MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- ConnectionList, passing the object as the first parameter, and the string 'ConnectionList' as the second parameter
If not, it will return a a Paws::Glue::GetConnectionsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllCrawlerMetrics(sub { },[CrawlerNameList => ArrayRef[Str|Undef], MaxResults => Int, NextToken => Str])
GetAllCrawlerMetrics([CrawlerNameList => ArrayRef[Str|Undef], MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- CrawlerMetricsList, passing the object as the first parameter, and the string 'CrawlerMetricsList' as the second parameter
If not, it will return a a Paws::Glue::GetCrawlerMetricsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllCrawlers(sub { },[MaxResults => Int, NextToken => Str])
GetAllCrawlers([MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- Crawlers, passing the object as the first parameter, and the string 'Crawlers' as the second parameter
If not, it will return a a Paws::Glue::GetCrawlersResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllDatabases(sub { },[CatalogId => Str, MaxResults => Int, NextToken => Str, ResourceShareType => Str])
GetAllDatabases([CatalogId => Str, MaxResults => Int, NextToken => Str, ResourceShareType => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- DatabaseList, passing the object as the first parameter, and the string 'DatabaseList' as the second parameter
If not, it will return a a Paws::Glue::GetDatabasesResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllDevEndpoints(sub { },[MaxResults => Int, NextToken => Str])
GetAllDevEndpoints([MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- DevEndpoints, passing the object as the first parameter, and the string 'DevEndpoints' as the second parameter
If not, it will return a a Paws::Glue::GetDevEndpointsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllJobRuns(sub { },JobName => Str, [MaxResults => Int, NextToken => Str])
GetAllJobRuns(JobName => Str, [MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- JobRuns, passing the object as the first parameter, and the string 'JobRuns' as the second parameter
If not, it will return a a Paws::Glue::GetJobRunsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllJobs(sub { },[MaxResults => Int, NextToken => Str])
GetAllJobs([MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- Jobs, passing the object as the first parameter, and the string 'Jobs' as the second parameter
If not, it will return a a Paws::Glue::GetJobsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllPartitionIndexes(sub { },DatabaseName => Str, TableName => Str, [CatalogId => Str, NextToken => Str])
GetAllPartitionIndexes(DatabaseName => Str, TableName => Str, [CatalogId => Str, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- PartitionIndexDescriptorList, passing the object as the first parameter, and the string 'PartitionIndexDescriptorList' as the second parameter
If not, it will return a a Paws::Glue::GetPartitionIndexesResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllPartitions(sub { },DatabaseName => Str, TableName => Str, [CatalogId => Str, ExcludeColumnSchema => Bool, Expression => Str, MaxResults => Int, NextToken => Str, Segment => Paws::Glue::Segment])
GetAllPartitions(DatabaseName => Str, TableName => Str, [CatalogId => Str, ExcludeColumnSchema => Bool, Expression => Str, MaxResults => Int, NextToken => Str, Segment => Paws::Glue::Segment])
If passed a sub as first parameter, it will call the sub for each element found in :
- Partitions, passing the object as the first parameter, and the string 'Partitions' as the second parameter
If not, it will return a a Paws::Glue::GetPartitionsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllResourcePolicies(sub { },[MaxResults => Int, NextToken => Str])
GetAllResourcePolicies([MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- GetResourcePoliciesResponseList, passing the object as the first parameter, and the string 'GetResourcePoliciesResponseList' as the second parameter
If not, it will return a a Paws::Glue::GetResourcePoliciesResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllSecurityConfigurations(sub { },[MaxResults => Int, NextToken => Str])
GetAllSecurityConfigurations([MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- SecurityConfigurations, passing the object as the first parameter, and the string 'SecurityConfigurations' as the second parameter
If not, it will return a a Paws::Glue::GetSecurityConfigurationsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllTables(sub { },DatabaseName => Str, [CatalogId => Str, Expression => Str, MaxResults => Int, NextToken => Str])
GetAllTables(DatabaseName => Str, [CatalogId => Str, Expression => Str, MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- TableList, passing the object as the first parameter, and the string 'TableList' as the second parameter
If not, it will return a a Paws::Glue::GetTablesResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllTableVersions(sub { },DatabaseName => Str, TableName => Str, [CatalogId => Str, MaxResults => Int, NextToken => Str])
GetAllTableVersions(DatabaseName => Str, TableName => Str, [CatalogId => Str, MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- TableVersions, passing the object as the first parameter, and the string 'TableVersions' as the second parameter
If not, it will return a a Paws::Glue::GetTableVersionsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllTriggers(sub { },[DependentJobName => Str, MaxResults => Int, NextToken => Str])
GetAllTriggers([DependentJobName => Str, MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- Triggers, passing the object as the first parameter, and the string 'Triggers' as the second parameter
If not, it will return a a Paws::Glue::GetTriggersResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
GetAllUserDefinedFunctions(sub { },Pattern => Str, [CatalogId => Str, DatabaseName => Str, MaxResults => Int, NextToken => Str])
GetAllUserDefinedFunctions(Pattern => Str, [CatalogId => Str, DatabaseName => Str, MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- UserDefinedFunctions, passing the object as the first parameter, and the string 'UserDefinedFunctions' as the second parameter
If not, it will return a a Paws::Glue::GetUserDefinedFunctionsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
ListAllRegistries(sub { },[MaxResults => Int, NextToken => Str])
ListAllRegistries([MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- Registries, passing the object as the first parameter, and the string 'Registries' as the second parameter
If not, it will return a a Paws::Glue::ListRegistriesResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
ListAllSchemas(sub { },[MaxResults => Int, NextToken => Str, RegistryId => Paws::Glue::RegistryId])
ListAllSchemas([MaxResults => Int, NextToken => Str, RegistryId => Paws::Glue::RegistryId])
If passed a sub as first parameter, it will call the sub for each element found in :
- Schemas, passing the object as the first parameter, and the string 'Schemas' as the second parameter
If not, it will return a a Paws::Glue::ListSchemasResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
ListAllSchemaVersions(sub { },SchemaId => Paws::Glue::SchemaId, [MaxResults => Int, NextToken => Str])
ListAllSchemaVersions(SchemaId => Paws::Glue::SchemaId, [MaxResults => Int, NextToken => Str])
If passed a sub as first parameter, it will call the sub for each element found in :
- Schemas, passing the object as the first parameter, and the string 'Schemas' as the second parameter
If not, it will return a a Paws::Glue::ListSchemaVersionsResponse instance with all the param
s; from all the responses. Please take into account that this mode can potentially consume vasts ammounts of memory.
SEE ALSO
This service class forms part of Paws
BUGS and CONTRIBUTIONS
The source code is located here: https://github.com/pplu/aws-sdk-perl
Please report bugs to: https://github.com/pplu/aws-sdk-perl/issues