tests.system.google.cloud.dataflow.example_dataflow_native_java

Example Airflow DAG for testing Google Dataflow Beam Pipeline Operator with Java.

Important Note:

This test downloads Java JAR file from the public bucket. In case the JAR file cannot be downloaded or is not compatible with the Java version used in the test, the source code for this test can be downloaded from here (https://beam.apache.org/get-started/wordcount-example) and needs to be compiled manually in order to work.

You can follow the instructions on how to pack a self-executing jar here: https://beam.apache.org/documentation/runners/dataflow/

Requirements:

These operators require the gcloud command and Java’s JRE to run.

Attributes

ENV_ID

DAG_ID

BUCKET_NAME

PUBLIC_BUCKET

JAR_FILE_NAME

GCS_JAR

GCS_OUTPUT

LOCATION

create_bucket

test_run

Module Contents

tests.system.google.cloud.dataflow.example_dataflow_native_java.ENV_ID[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.DAG_ID = 'dataflow_native_java'[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.BUCKET_NAME[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.PUBLIC_BUCKET = 'airflow-system-tests-resources'[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.JAR_FILE_NAME = 'word-count-beam-bundled-0.1.jar'[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.GCS_JAR = 'gs://airflow-system-tests-resources/dataflow/java/word-count-beam-bundled-0.1.jar'[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.GCS_OUTPUT[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.LOCATION = 'europe-west3'[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.create_bucket[source]
tests.system.google.cloud.dataflow.example_dataflow_native_java.test_run[source]

Was this entry helpful?