airflow.providers.amazon.aws.operators.glue_crawler

Module Contents

Classes

GlueCrawlerOperator

Creates, updates and triggers an AWS Glue Crawler. AWS Glue Crawler is a serverless

AwsGlueCrawlerOperator

This operator is deprecated.

class airflow.providers.amazon.aws.operators.glue_crawler.GlueCrawlerOperator(config, aws_conn_id='aws_default', poll_interval=5, wait_for_completion=True, **kwargs)[source]

Bases: airflow.models.BaseOperator

Creates, updates and triggers an AWS Glue Crawler. AWS Glue Crawler is a serverless service that manages a catalog of metadata tables that contain the inferred schema, format and data types of data stores within the AWS cloud.

See also

For more information on how to use this operator, take a look at the guide: AWS Glue Crawler Operator

Parameters
  • config – Configurations for the AWS Glue crawler

  • aws_conn_id – aws connection to use

  • poll_interval (int) – Time (in seconds) to wait between two consecutive calls to check crawler status

  • wait_for_completion (bool) – Whether or not wait for crawl execution completion. (default: True)

ui_color = #ededed[source]
hook(self)[source]

Create and return an GlueCrawlerHook.

execute(self, context)[source]

Executes AWS Glue Crawler from Airflow

Returns

the name of the current glue crawler.

class airflow.providers.amazon.aws.operators.glue_crawler.AwsGlueCrawlerOperator(*args, **kwargs)[source]

Bases: GlueCrawlerOperator

This operator is deprecated. Please use airflow.providers.amazon.aws.operators.glue_crawler.GlueCrawlerOperator.

Was this entry helpful?