Amazon S3 Glacier to GCS

Amazon Glacier is a secure, durable, and extremely low-cost Amazon S3 cloud storage class for data archiving and long-term backup.

Prerequisite Tasks

To use these operators, you must do a few things:

Operators

Amazon S3 Glacier To GCS transfer operator

To transfer data from an Amazon Glacier vault to Google Cloud Storage you can use GlacierToGCSOperator

tests/system/providers/amazon/aws/example_glacier_to_gcs.py[source]

transfer_archive_to_gcs = GlacierToGCSOperator(
    task_id="transfer_archive_to_gcs",
    vault_name=vault_name,
    bucket_name=gcs_bucket_name,
    object_name=gcs_object_name,
    gzip=False,
    # Override to match your needs
    # If chunk size is bigger than actual file size
    # then whole file will be downloaded
    chunk_size=1024,
)

Note

Please be aware that GlacierToGCSOperator depends on available memory. Transferring large files may exhaust memory on the worker host.

Was this entry helpful?