airflow.providers.google.cloud.hooks.kubernetes_engine

This module contains a Google Kubernetes Engine Hook.

Module Contents

Classes

GKEHook

Google Kubernetes Engine cluster APIs.

GKEDeploymentHook

Google Kubernetes Engine Deployment APIs.

GKEAsyncHook

Asynchronous client of GKE.

GKEPodHook

Google Kubernetes Engine pod APIs.

GKEJobHook

Google Kubernetes Engine Job APIs.

GKEPodAsyncHook

Google Kubernetes Engine pods APIs asynchronously.

Attributes

OPERATIONAL_POLL_INTERVAL

airflow.providers.google.cloud.hooks.kubernetes_engine.OPERATIONAL_POLL_INTERVAL = 15[source]
class airflow.providers.google.cloud.hooks.kubernetes_engine.GKEHook(gcp_conn_id='google_cloud_default', location=None, impersonation_chain=None, **kwargs)[source]

Bases: airflow.providers.google.common.hooks.base_google.GoogleBaseHook

Google Kubernetes Engine cluster APIs.

All the methods in the hook where project_id is used must be called with keyword arguments rather than positional.

get_cluster_manager_client()[source]

Create or get a ClusterManagerClient.

get_conn()[source]

Return connection for the hook.

get_client()[source]
wait_for_operation(operation, project_id=None)[source]

Continuously fetch the status from Google Cloud.

This is done until the given operation completes, or raises an error.

Parameters
Returns

A new, updated operation fetched from Google Cloud.

Return type

google.cloud.container_v1.types.Operation

get_operation(operation_name, project_id=None)[source]

Get an operation from Google Cloud.

Parameters
  • operation_name (str) – Name of operation to fetch

  • project_id (str | None) – Google Cloud project ID

Returns

The new, updated operation from Google Cloud

Return type

google.cloud.container_v1.types.Operation

delete_cluster(name, project_id=PROVIDE_PROJECT_ID, wait_to_complete=True, retry=DEFAULT, timeout=None)[source]

Delete the cluster, the Kubernetes endpoint, and all worker nodes.

Firewalls and routes that were configured during cluster creation are also deleted. Other Google Compute Engine resources that might be in use by the cluster (e.g. load balancer resources) will not be deleted if they were not present at the initial create time.

Parameters
  • name (str) – The name of the cluster to delete.

  • project_id (str) – Google Cloud project ID.

  • wait_to_complete (bool) – If True, wait until the deletion is finished before returning.

  • retry (google.api_core.retry.Retry | google.api_core.gapic_v1.method._MethodDefault) – Retry object used to determine when/if to retry requests. If None is specified, requests will not be retried.

  • timeout (float | None) – The amount of time, in seconds, to wait for the request to complete. Note that if retry is specified, the timeout applies to each individual attempt.

Returns

The full url to the delete operation if successful, else None.

Return type

google.cloud.container_v1.types.Operation | None

create_cluster(cluster, project_id=PROVIDE_PROJECT_ID, wait_to_complete=True, retry=DEFAULT, timeout=None)[source]

Create a cluster.

This should consist of the specified number, and the type of Google Compute Engine instances.

Parameters
  • cluster (dict | google.cloud.container_v1.types.Cluster) – A Cluster protobuf or dict. If dict is provided, it must be of the same form as the protobuf message google.cloud.container_v1.types.Cluster.

  • project_id (str) – Google Cloud project ID.

  • wait_to_complete (bool) – A boolean value which makes method to sleep while operation of creation is not finished.

  • retry (google.api_core.retry.Retry | google.api_core.gapic_v1.method._MethodDefault) – A retry object (google.api_core.retry.Retry) used to retry requests. If None is specified, requests will not be retried.

  • timeout (float | None) – The amount of time, in seconds, to wait for the request to complete. Note that if retry is specified, the timeout applies to each individual attempt.

Returns

The full url to the new, or existing, cluster.

Raises
  • ParseError – On JSON parsing problems when trying to convert dict.

  • AirflowException – cluster is not dict type nor Cluster proto type.

Return type

google.cloud.container_v1.types.Operation | google.cloud.container_v1.types.Cluster

get_cluster(name, project_id=PROVIDE_PROJECT_ID, retry=DEFAULT, timeout=None)[source]

Get details of specified cluster.

Parameters
  • name (str) – The name of the cluster to retrieve.

  • project_id (str) – Google Cloud project ID.

  • retry (google.api_core.retry.Retry | google.api_core.gapic_v1.method._MethodDefault) – A retry object used to retry requests. If None is specified, requests will not be retried.

  • timeout (float | None) – The amount of time, in seconds, to wait for the request to complete. Note that if retry is specified, the timeout applies to each individual attempt.

check_cluster_autoscaling_ability(cluster)[source]

Check if the specified Cluster has ability to autoscale.

Cluster should be Autopilot, with Node Auto-provisioning or regular auto-scaled node pools. Returns True if the Cluster supports autoscaling, otherwise returns False.

Parameters

cluster (google.cloud.container_v1.types.Cluster | dict) – The Cluster object.

class airflow.providers.google.cloud.hooks.kubernetes_engine.GKEDeploymentHook(cluster_url, ssl_ca_cert, *args, **kwargs)[source]

Bases: airflow.providers.google.common.hooks.base_google.GoogleBaseHook, airflow.providers.cncf.kubernetes.hooks.kubernetes.KubernetesHook

Google Kubernetes Engine Deployment APIs.

api_client()[source]

Cached Kubernetes API client.

core_v1_client()[source]

Get authenticated client object.

batch_v1_client()[source]
apps_v1_client()[source]
get_conn()[source]

Return connection for the hook.

check_kueue_deployment_running(name, namespace)[source]
class airflow.providers.google.cloud.hooks.kubernetes_engine.GKEAsyncHook(gcp_conn_id='google_cloud_default', location=None, impersonation_chain=None)[source]

Bases: airflow.providers.google.common.hooks.base_google.GoogleBaseAsyncHook

Asynchronous client of GKE.

sync_hook_class[source]
async get_operation(operation_name, project_id=PROVIDE_PROJECT_ID)[source]

Fetch an operation from Google Cloud.

Parameters
  • operation_name (str) – Name of operation to fetch.

  • project_id (str) – Google Cloud project ID.

Returns

The new, updated operation from Google Cloud.

Return type

google.cloud.container_v1.types.Operation

class airflow.providers.google.cloud.hooks.kubernetes_engine.GKEPodHook(cluster_url, ssl_ca_cert, disable_tcp_keepalive=None, gcp_conn_id='google_cloud_default', impersonation_chain=None, **kwargs)[source]

Bases: airflow.providers.google.common.hooks.base_google.GoogleBaseHook, airflow.providers.cncf.kubernetes.utils.pod_manager.PodOperatorHookProtocol

Google Kubernetes Engine pod APIs.

property is_in_cluster: bool[source]

Expose whether the hook is configured with load_incluster_config or not.

api_client()[source]
core_v1_client()[source]

Get authenticated client object.

get_namespace()[source]

Get the namespace configured by the Airflow connection.

get_xcom_sidecar_container_image()[source]

Get the xcom sidecar image defined in the connection.

Implemented for compatibility with KubernetesHook.

get_xcom_sidecar_container_resources()[source]

Get the xcom sidecar resources defined in the connection.

Implemented for compatibility with KubernetesHook.

get_conn()[source]

Return connection for the hook.

apply_from_yaml_file(yaml_file=None, yaml_objects=None, verbose=False, namespace='default')[source]

Perform an action from a yaml file on a Pod.

Parameters
  • yaml_file (str | None) – Contains the path to yaml file.

  • yaml_objects (list[dict] | None) – List of YAML objects; used instead of reading the yaml_file.

  • verbose (bool) – If True, print confirmation from create action. Default is False.

  • namespace (str) – Contains the namespace to create all resources inside. The namespace must preexist otherwise the resource creation will fail.

get_pod(name, namespace)[source]

Get a pod object.

Parameters
  • name (str) – Name of the pod.

  • namespace (str) – Name of the pod’s namespace.

class airflow.providers.google.cloud.hooks.kubernetes_engine.GKEJobHook(cluster_url, ssl_ca_cert, *args, **kwargs)[source]

Bases: airflow.providers.google.common.hooks.base_google.GoogleBaseHook, airflow.providers.cncf.kubernetes.hooks.kubernetes.KubernetesHook

Google Kubernetes Engine Job APIs.

api_client()[source]

Cached Kubernetes API client.

core_v1_client()[source]

Get authenticated client object.

batch_v1_client()[source]
get_conn()[source]

Return connection for the hook.

class airflow.providers.google.cloud.hooks.kubernetes_engine.GKEPodAsyncHook(cluster_url, ssl_ca_cert, gcp_conn_id='google_cloud_default', impersonation_chain=None, **kwargs)[source]

Bases: airflow.providers.google.common.hooks.base_google.GoogleBaseAsyncHook

Google Kubernetes Engine pods APIs asynchronously.

Parameters
  • cluster_url (str) – The URL pointed to the cluster.

  • ssl_ca_cert (str) – SSL certificate used for authentication to the pod.

sync_hook_class[source]
scopes = ['https://www.googleapis.com/auth/cloud-platform'][source]
async get_conn(token)[source]
async get_pod(name, namespace)[source]

Get a pod object.

Parameters
  • name (str) – Name of the pod.

  • namespace (str) – Name of the pod’s namespace.

async delete_pod(name, namespace)[source]

Delete a pod.

Parameters
  • name (str) – Name of the pod.

  • namespace (str) – Name of the pod’s namespace.

async read_logs(name, namespace)[source]

Read logs inside the pod while starting containers inside.

All the logs will be outputted with its timestamp to track the logs after the execution of the pod is completed. The method is used for async output of the logs only in the pod failed it execution or the task was cancelled by the user.

Parameters
  • name (str) – Name of the pod.

  • namespace (str) – Name of the pod’s namespace.

Was this entry helpful?