Complete the airflow survey & get a free airflow 3 certification!

airflow.providers.google.cloud.log.gcs_task_handler

Attributes

logger

Classes

GCSRemoteLogIO

Convenience super-class to have a logger configured with the class name.

GCSTaskHandler

GCSTaskHandler is a python log handler that handles and reads task instance logs.

Module Contents

airflow.providers.google.cloud.log.gcs_task_handler.logger[source]
class airflow.providers.google.cloud.log.gcs_task_handler.GCSRemoteLogIO(context=None)[source]

Bases: airflow.utils.log.logging_mixin.LoggingMixin

Convenience super-class to have a logger configured with the class name.

remote_base: str[source]
base_log_folder: pathlib.Path[source]
delete_local_copy: bool[source]
project_id: str | None = None[source]
gcp_key_path: str | None = None[source]
gcp_keyfile_dict: dict | None = None[source]
scopes: collections.abc.Collection[str] | None[source]
processors = ()[source]
upload(path, ti)[source]

Upload the given log path to the remote storage.

property hook: airflow.providers.google.cloud.hooks.gcs.GCSHook | None[source]

Returns GCSHook if remote_log_conn_id configured.

property client: google.cloud.storage.Client[source]

Returns GCS Client.

write(log, remote_log_location)[source]

Write the log to the remote location and return True; fail silently and return False on error.

Parameters:
  • log (str) – the log to write to the remote_log_location

  • remote_log_location (str) – the log’s location in remote storage

Returns:

whether the log is successfully written to remote location or not.

Return type:

bool

static no_log_found(exc)[source]

Given exception, determine whether it is result of log not found.

read(relative_path, ti)[source]
class airflow.providers.google.cloud.log.gcs_task_handler.GCSTaskHandler(*, base_log_folder, gcs_log_folder, gcp_key_path=None, gcp_keyfile_dict=None, gcp_scopes=_DEFAULT_SCOPESS, project_id=PROVIDE_PROJECT_ID, max_bytes=0, backup_count=0, delay=False, **kwargs)[source]

Bases: airflow.utils.log.file_task_handler.FileTaskHandler, airflow.utils.log.logging_mixin.LoggingMixin

GCSTaskHandler is a python log handler that handles and reads task instance logs.

It extends airflow FileTaskHandler and uploads to and reads from GCS remote storage. Upon log reading failure, it reads from host machine’s local disk.

Parameters:
  • base_log_folder (str) – Base log folder to place logs.

  • gcs_log_folder (str) – Path to a remote location where logs will be saved. It must have the prefix gs://. For example: gs://bucket/remote/log/location

  • filename_template – template filename string

  • gcp_key_path (str | None) – Path to Google Cloud Service Account file (JSON). Mutually exclusive with gcp_keyfile_dict. If omitted, authorization based on the Application Default Credentials will be used.

  • gcp_keyfile_dict (dict | None) – Dictionary of keyfile parameters. Mutually exclusive with gcp_key_path.

  • gcp_scopes (collections.abc.Collection[str] | None) – Comma-separated string containing OAuth2 scopes

  • project_id (str) – Project ID to read the secrets from. If not passed, the project ID from credentials will be used.

  • delete_local_copy – Whether local log files should be deleted after they are downloaded when using remote logging

trigger_should_wrap = True[source]
handler: logging.FileHandler | None = None[source]
log_relative_path = ''[source]
closed = False[source]
upload_on_close = True[source]
io[source]
set_context(ti, *, identifier=None)[source]

Provide task_instance context to airflow task handler.

Generally speaking returns None. But if attr maintain_propagate has been set to propagate, then returns sentinel MAINTAIN_PROPAGATE. This has the effect of overriding the default behavior to set propagate to False whenever set_context is called. At time of writing, this functionality is only used in unit testing.

Parameters:
  • ti (airflow.models.taskinstance.TaskInstance) – task instance object

  • identifier (str | None) – if set, adds suffix to log file. For use when relaying exceptional messages to task logs from a context other than task or trigger run

close()[source]

Close and upload local log file to remote storage GCS.

Was this entry helpful?