Google Cloud BigQuery Transfer Operator to Postgres¶
Google Cloud BigQuery is Google Cloud’s serverless data warehouse offering. PostgreSQL is an open-source relational database management system. This operator can be used to copy data from a BigQuery table to PostgreSQL.
Prerequisite Tasks¶
To use these operators, you must do a few things:
Select or create a Cloud Platform project using the Cloud Console.
Enable billing for your project, as described in the Google Cloud documentation.
Enable the API, as described in the Cloud Console documentation.
Install API libraries via pip.
pip install 'apache-airflow[google]'Detailed information is available for Installation.
Operator¶
Copying data from BigQuery table to Postgres table is performed with the
BigQueryToPostgresOperator
operator.
Use Jinja templating with
target_table_name
, impersonation_chain
, dataset_id
, table_id
to define values dynamically.
You may use the parameter selected_fields
to limit the fields to be copied (all fields by default),
as well as the parameter replace
to overwrite the destination table instead of appending to it.
For more information, please refer to the links above.
Transferring data¶
The following Operator copies data from a BigQuery table to PostgreSQL.
bigquery_to_postgres = BigQueryToPostgresOperator(
task_id="bigquery_to_postgres",
postgres_conn_id=CONNECTION_ID,
dataset_table=f"{BIGQUERY_DATASET_NAME}.{BIGQUERY_TABLE}",
target_table_name=SQL_TABLE,
batch_size=BATCH_SIZE,
replace=False,
)
Reference¶
For further information, look at: