airflow.providers.amazon.aws.sensors.emr¶
Module Contents¶
Classes¶
| Contains general sensor behavior for EMR. | |
| Poll the state of the job run until it reaches a terminal state; fails if the job run fails. | |
| Poll the state of the application until it reaches a terminal state; fails if the application fails. | |
| Poll the state of the job run until it reaches a terminal state; fail if the job run fails. | |
| Poll the EMR notebook until it reaches any of the target states; raise AirflowException on failure. | |
| Poll the EMR JobFlow Cluster until it reaches any of the target states; raise AirflowException on failure. | |
| Poll the state of the step until it reaches any of the target states; raise AirflowException on failure. | 
- class airflow.providers.amazon.aws.sensors.emr.EmrBaseSensor(*, aws_conn_id='aws_default', **kwargs)[source]¶
- Bases: - airflow.sensors.base.BaseSensorOperator- Contains general sensor behavior for EMR. - Subclasses should implement following methods:
- get_emr_response()
- state_from_response()
- failure_message_from_response()
 
 - Subclasses should set - target_statesand- failed_statesfields.- Parameters
- aws_conn_id (str) – aws connection to use 
 
- class airflow.providers.amazon.aws.sensors.emr.EmrServerlessJobSensor(*, application_id, job_run_id, target_states=frozenset(EmrServerlessHook.JOB_SUCCESS_STATES), aws_conn_id='aws_default', **kwargs)[source]¶
- Bases: - airflow.sensors.base.BaseSensorOperator- Poll the state of the job run until it reaches a terminal state; fails if the job run fails. - See also - For more information on how to use this sensor, take a look at the guide: Wait on an EMR Serverless Job state - Parameters
 
- class airflow.providers.amazon.aws.sensors.emr.EmrServerlessApplicationSensor(*, application_id, target_states=frozenset(EmrServerlessHook.APPLICATION_SUCCESS_STATES), aws_conn_id='aws_default', **kwargs)[source]¶
- Bases: - airflow.sensors.base.BaseSensorOperator- Poll the state of the application until it reaches a terminal state; fails if the application fails. - See also - For more information on how to use this sensor, take a look at the guide: Wait on an EMR Serverless Application state - Parameters
 
- class airflow.providers.amazon.aws.sensors.emr.EmrContainerSensor(*, virtual_cluster_id, job_id, max_retries=None, aws_conn_id='aws_default', poll_interval=10, deferrable=conf.getboolean('operators', 'default_deferrable', fallback=False), **kwargs)[source]¶
- Bases: - airflow.sensors.base.BaseSensorOperator- Poll the state of the job run until it reaches a terminal state; fail if the job run fails. - See also - For more information on how to use this sensor, take a look at the guide: Wait on an Amazon EMR virtual cluster job - Parameters
- job_id (str) – job_id to check the state of 
- max_retries (int | None) – Number of times to poll for query state before returning the current state, defaults to None 
- aws_conn_id (str) – aws connection to use, defaults to ‘aws_default’ 
- poll_interval (int) – Time in seconds to wait between two consecutive call to check query status on athena, defaults to 10 
- deferrable (bool) – Run sensor in the deferrable mode. 
 
 
- class airflow.providers.amazon.aws.sensors.emr.EmrNotebookExecutionSensor(notebook_execution_id, target_states=None, failed_states=None, **kwargs)[source]¶
- Bases: - EmrBaseSensor- Poll the EMR notebook until it reaches any of the target states; raise AirflowException on failure. - See also - For more information on how to use this sensor, take a look at the guide: Wait on an EMR notebook execution state - Parameters
- notebook_execution_id (str) – Unique id of the notebook execution to be poked. 
- Target_states
- the states the sensor will wait for the execution to reach. Default target_states is - FINISHED.
- Failed_states
- if the execution reaches any of the failed_states, the sensor will fail. Default failed_states is - FAILED.
 
- class airflow.providers.amazon.aws.sensors.emr.EmrJobFlowSensor(*, job_flow_id, target_states=None, failed_states=None, max_attempts=60, deferrable=conf.getboolean('operators', 'default_deferrable', fallback=False), **kwargs)[source]¶
- Bases: - EmrBaseSensor- Poll the EMR JobFlow Cluster until it reaches any of the target states; raise AirflowException on failure. - With the default target states, sensor waits cluster to be terminated. When target_states is set to [‘RUNNING’, ‘WAITING’] sensor waits until job flow to be ready (after ‘STARTING’ and ‘BOOTSTRAPPING’ states) - See also - For more information on how to use this sensor, take a look at the guide: Wait on an Amazon EMR job flow state - Parameters
- job_flow_id (str) – job_flow_id to check the state of 
- target_states (Iterable[str] | None) – the target states, sensor waits until job flow reaches any of these states. In deferrable mode it would run until reach the terminal state. 
- failed_states (Iterable[str] | None) – the failure states, sensor fails when job flow reaches any of these states 
- max_attempts (int) – Maximum number of tries before failing 
- deferrable (bool) – Run sensor in the deferrable mode. 
 
 - static failure_message_from_response(response)[source]¶
- Get failure message from response dictionary. 
 
- class airflow.providers.amazon.aws.sensors.emr.EmrStepSensor(*, job_flow_id, step_id, target_states=None, failed_states=None, max_attempts=60, deferrable=conf.getboolean('operators', 'default_deferrable', fallback=False), **kwargs)[source]¶
- Bases: - EmrBaseSensor- Poll the state of the step until it reaches any of the target states; raise AirflowException on failure. - With the default target states, sensor waits step to be completed. - See also - For more information on how to use this sensor, take a look at the guide: Wait on an Amazon EMR step state - Parameters
- job_flow_id (str) – job_flow_id which contains the step check the state of 
- step_id (str) – step to check the state of 
- target_states (Iterable[str] | None) – the target states, sensor waits until step reaches any of these states. In case of deferrable sensor it will for reach to terminal state 
- failed_states (Iterable[str] | None) – the failure states, sensor fails when step reaches any of these states 
- max_attempts (int) – Maximum number of tries before failing 
- deferrable (bool) – Run sensor in the deferrable mode. 
 
 - template_fields: Sequence[str] = ('job_flow_id', 'step_id', 'target_states', 'failed_states')[source]¶
 - get_emr_response(context)[source]¶
- Make an API call with boto3 and get details about the cluster step. 
 - static failure_message_from_response(response)[source]¶
- Get failure message from response dictionary. 
 
