airflow.providers.amazon.aws.hooks.athena
¶
This module contains AWS Athena hook
Module Contents¶
-
class
airflow.providers.amazon.aws.hooks.athena.
AWSAthenaHook
(*args, sleep_time: int = 30, **kwargs)[source]¶ Bases:
airflow.providers.amazon.aws.hooks.base_aws.AwsBaseHook
Interact with AWS Athena to run, poll queries and return query results
Additional arguments (such as
aws_conn_id
) may be specified and are passed down to the underlying AwsBaseHook.See also
- Parameters
sleep_time (int) – Time (in seconds) to wait between two consecutive calls to check query status on Athena
-
run_query
(self, query: str, query_context: Dict[str, str], result_configuration: Dict[str, Any], client_request_token: Optional[str] = None, workgroup: str = 'primary')[source]¶ Run Presto query on athena with provided config and return submitted query_execution_id
- Parameters
query (str) – Presto query to run
query_context (dict) – Context in which query need to be run
result_configuration (dict) – Dict with path to store results in and config related to encryption
client_request_token (str) – Unique token created by user to avoid multiple executions of same query
workgroup (str) – Athena workgroup name, when not specified, will be ‘primary’
- Returns
str
-
check_query_status
(self, query_execution_id: str)[source]¶ Fetch the status of submitted athena query. Returns None or one of valid query states.
- Parameters
query_execution_id (str) – Id of submitted athena query
- Returns
str
-
get_state_change_reason
(self, query_execution_id: str)[source]¶ Fetch the reason for a state change (e.g. error message). Returns None or reason string.
- Parameters
query_execution_id (str) – Id of submitted athena query
- Returns
str
-
get_query_results
(self, query_execution_id: str, next_token_id: Optional[str] = None, max_results: int = 1000)[source]¶ Fetch submitted athena query results. returns none if query is in intermediate state or failed/cancelled state else dict of query output
-
get_query_results_paginator
(self, query_execution_id: str, max_items: Optional[int] = None, page_size: Optional[int] = None, starting_token: Optional[str] = None)[source]¶ Fetch submitted athena query results. returns none if query is in intermediate state or failed/cancelled state else a paginator to iterate through pages of results. If you wish to get all results at once, call build_full_result() on the returned PageIterator