Module Contents



GCSTaskHandler is a python log handler that handles and reads

class*, base_log_folder, gcs_log_folder, filename_template=None, gcp_key_path=None, gcp_keyfile_dict=None, gcp_scopes=_DEFAULT_SCOPESS, project_id=None)[source]

Bases: airflow.utils.log.file_task_handler.FileTaskHandler, airflow.utils.log.logging_mixin.LoggingMixin

GCSTaskHandler is a python log handler that handles and reads task instance logs. It extends airflow FileTaskHandler and uploads to and reads from GCS remote storage. Upon log reading failure, it reads from host machine's local disk.

  • base_log_folder (str) -- Base log folder to place logs.

  • gcs_log_folder (str) -- Path to a remote location where logs will be saved. It must have the prefix gs://. For example: gs://bucket/remote/log/location

  • filename_template (Optional[str]) -- template filename string

  • gcp_key_path (Optional[str]) -- Path to Google Cloud Service Account file (JSON). Mutually exclusive with gcp_keyfile_dict. If omitted, authorization based on the Application Default Credentials will be used.

  • gcp_keyfile_dict (Optional[dict]) -- Dictionary of keyfile parameters. Mutually exclusive with gcp_key_path.

  • gcp_scopes (Optional[Collection[str]]) -- Comma-separated string containing OAuth2 scopes

  • project_id (Optional[str]) -- Project ID to read the secrets from. If not passed, the project ID from credentials will be used.


Returns GCS Client.

set_context(self, ti)[source]

Provide task_instance context to airflow task handler.


ti -- task instance object


Close and upload local log file to remote storage GCS.

gcs_write(self, log, remote_log_location)[source]

Writes the log to the remote_log_location. Fails silently if no log was created.

  • log -- the log to write to the remote_log_location

  • remote_log_location -- the log's location in remote storage

Was this entry helpful?