airflow.providers.amazon.aws.hooks.glue
¶
Module Contents¶
Classes¶
Interact with AWS Glue - create job, trigger, crawler |
|
This hook is deprecated. |
- class airflow.providers.amazon.aws.hooks.glue.GlueJobHook(s3_bucket: Optional[str] = None, job_name: Optional[str] = None, desc: Optional[str] = None, concurrent_run_limit: int = 1, script_location: Optional[str] = None, retry_limit: int = 0, num_of_dpus: Optional[int] = None, iam_role_name: Optional[str] = None, create_job_kwargs: Optional[dict] = None, *args, **kwargs)[source]¶
Bases:
airflow.providers.amazon.aws.hooks.base_aws.AwsBaseHook
Interact with AWS Glue - create job, trigger, crawler
- Parameters
s3_bucket (Optional[str]) -- S3 bucket where logs and local etl script will be uploaded
job_name (Optional[str]) -- unique job name per AWS account
desc (Optional[str]) -- job description
concurrent_run_limit (int) -- The maximum number of concurrent runs allowed for a job
script_location (Optional[str]) -- path to etl script on s3
retry_limit (int) -- Maximum number of times to retry this job if it fails
num_of_dpus (int) -- Number of AWS Glue DPUs to allocate to this Job
region_name (Optional[str]) -- aws region name (example: us-east-1)
iam_role_name (Optional[str]) -- AWS IAM Role for Glue Job Execution
create_job_kwargs (Optional[dict]) -- Extra arguments for Glue Job Creation
- initialize_job(self, script_arguments: Optional[dict] = None, run_kwargs: Optional[dict] = None) Dict[str, str] [source]¶
Initializes connection with AWS Glue to run job :return:
- get_job_state(self, job_name: str, run_id: str) str [source]¶
Get state of the Glue job. The job state can be running, finished, failed, stopped or timeout. :param job_name: unique job name per AWS account :type job_name: str :param run_id: The job-run ID of the predecessor job run :type run_id: str :return: State of the Glue job
- job_completion(self, job_name: str, run_id: str) Dict[str, str] [source]¶
Waits until Glue job with job_name completes or fails and return final state if finished. Raises AirflowException when the job failed :param job_name: unique job name per AWS account :type job_name: str :param run_id: The job-run ID of the predecessor job run :type run_id: str :return: Dict of JobRunState and JobRunId
- class airflow.providers.amazon.aws.hooks.glue.AwsGlueJobHook(*args, **kwargs)[source]¶
Bases:
GlueJobHook
This hook is deprecated. Please use
airflow.providers.amazon.aws.hooks.glue.GlueJobHook
.