Amazon S3 Glacier Operators

Amazon Glacier is a secure, durable, and extremely low-cost Amazon S3 cloud storage class for data archiving and long-term backup.

Prerequisite Tasks

To use these operators, you must do a few things:

Amazon Glacier Create Job Operator

To initiate an Amazon Glacier inventory retrieval job use GlacierCreateJobOperator

This Operator returns a dictionary of information related to the initiated job such as jobId, which is required for subsequent tasks.

airflow/providers/amazon/aws/example_dags/example_glacier_to_gcs.py[source]

create_glacier_job = GlacierCreateJobOperator(task_id="create_glacier_job", vault_name=VAULT_NAME)
JOB_ID = '{{ task_instance.xcom_pull("create_glacier_job")["jobId"] }}'

Amazon Glacier Job Sensor

To wait on the status of an Amazon Glacier Job to reach a terminal state use GlacierJobOperationSensor

airflow/providers/amazon/aws/example_dags/example_glacier_to_gcs.py[source]

wait_for_operation_complete = GlacierJobOperationSensor(
    vault_name=VAULT_NAME,
    job_id=JOB_ID,
    task_id="wait_for_operation_complete",
)

References

For further information, look at:

Was this entry helpful?