airflow.providers.amazon.aws.sensors.emr
¶
Module Contents¶
Classes¶
Contains general sensor behavior for EMR. |
|
Asks for the state of the job run until it reaches a failure state or success state. |
|
Asks for the state of the application until it reaches a failure state or success state. |
|
Asks for the state of the job run until it reaches a failure state or success state. |
|
Polls the state of the EMR notebook execution until it reaches |
|
Asks for the state of the EMR JobFlow (Cluster) until it reaches |
|
Asks for the state of the step until it reaches any of the target states. |
- class airflow.providers.amazon.aws.sensors.emr.EmrBaseSensor(*, aws_conn_id='aws_default', **kwargs)[source]¶
Bases:
airflow.sensors.base.BaseSensorOperator
Contains general sensor behavior for EMR.
- Subclasses should implement following methods:
get_emr_response()
state_from_response()
failure_message_from_response()
Subclasses should set
target_states
andfailed_states
fields.- Parameters
aws_conn_id (str) – aws connection to use
- class airflow.providers.amazon.aws.sensors.emr.EmrServerlessJobSensor(*, application_id, job_run_id, target_states=frozenset(EmrServerlessHook.JOB_SUCCESS_STATES), aws_conn_id='aws_default', **kwargs)[source]¶
Bases:
airflow.sensors.base.BaseSensorOperator
Asks for the state of the job run until it reaches a failure state or success state. If the job run fails, the task will fail.
See also
For more information on how to use this sensor, take a look at the guide: Wait on an EMR Serverless Job state
- Parameters
- class airflow.providers.amazon.aws.sensors.emr.EmrServerlessApplicationSensor(*, application_id, target_states=frozenset(EmrServerlessHook.APPLICATION_SUCCESS_STATES), aws_conn_id='aws_default', **kwargs)[source]¶
Bases:
airflow.sensors.base.BaseSensorOperator
Asks for the state of the application until it reaches a failure state or success state. If the application fails, the task will fail.
See also
For more information on how to use this sensor, take a look at the guide: Wait on an EMR Serverless Application state
- Parameters
- class airflow.providers.amazon.aws.sensors.emr.EmrContainerSensor(*, virtual_cluster_id, job_id, max_retries=None, aws_conn_id='aws_default', poll_interval=10, **kwargs)[source]¶
Bases:
airflow.sensors.base.BaseSensorOperator
Asks for the state of the job run until it reaches a failure state or success state. If the job run fails, the task will fail.
See also
For more information on how to use this sensor, take a look at the guide: Wait on an Amazon EMR virtual cluster job
- Parameters
job_id (str) – job_id to check the state of
max_retries (int | None) – Number of times to poll for query state before returning the current state, defaults to None
aws_conn_id (str) – aws connection to use, defaults to ‘aws_default’
poll_interval (int) – Time in seconds to wait between two consecutive call to check query status on athena, defaults to 10
- class airflow.providers.amazon.aws.sensors.emr.EmrNotebookExecutionSensor(notebook_execution_id, target_states=None, failed_states=None, **kwargs)[source]¶
Bases:
EmrBaseSensor
Polls the state of the EMR notebook execution until it reaches any of the target states. If a failure state is reached, the sensor throws an error, and fails the task.
See also
For more information on how to use this sensor, take a look at the guide: Wait on an EMR notebook execution state
- Parameters
notebook_execution_id (str) – Unique id of the notebook execution to be poked.
- Target_states
the states the sensor will wait for the execution to reach. Default target_states is
FINISHED
.- Failed_states
if the execution reaches any of the failed_states, the sensor will fail. Default failed_states is
FAILED
.
- class airflow.providers.amazon.aws.sensors.emr.EmrJobFlowSensor(*, job_flow_id, target_states=None, failed_states=None, **kwargs)[source]¶
Bases:
EmrBaseSensor
Asks for the state of the EMR JobFlow (Cluster) until it reaches any of the target states. If it fails the sensor errors, failing the task.
With the default target states, sensor waits cluster to be terminated. When target_states is set to [‘RUNNING’, ‘WAITING’] sensor waits until job flow to be ready (after ‘STARTING’ and ‘BOOTSTRAPPING’ states)
See also
For more information on how to use this sensor, take a look at the guide: Wait on an Amazon EMR job flow state
- Parameters
- class airflow.providers.amazon.aws.sensors.emr.EmrStepSensor(*, job_flow_id, step_id, target_states=None, failed_states=None, **kwargs)[source]¶
Bases:
EmrBaseSensor
Asks for the state of the step until it reaches any of the target states. If it fails the sensor errors, failing the task.
With the default target states, sensor waits step to be completed.
See also
For more information on how to use this sensor, take a look at the guide: Wait on an Amazon EMR step state
- Parameters
job_flow_id (str) – job_flow_id which contains the step check the state of
step_id (str) – step to check the state of
target_states (Iterable[str] | None) – the target states, sensor waits until step reaches any of these states
failed_states (Iterable[str] | None) – the failure states, sensor fails when step reaches any of these states
- template_fields: Sequence[str] = ('job_flow_id', 'step_id', 'target_states', 'failed_states')[source]¶
- get_emr_response(context)[source]¶
Make an API call with boto3 and get details about the cluster step.