tests.system.providers.google.cloud.dataflow.example_dataflow_native_java

Example Airflow DAG for testing Google Dataflow Beam Pipeline Operator with Java.

Important Note:

This test downloads Java JAR file from the public bucket. In case the JAR file cannot be downloaded or is not compatible with the Java version used in the test, the source code for this test can be downloaded from here (https://beam.apache.org/get-started/wordcount-example) and needs to be compiled manually in order to work.

You can follow the instructions on how to pack a self-executing jar here: https://beam.apache.org/documentation/runners/dataflow/

Requirements:

These operators require the gcloud command and Java’s JRE to run.

Module Contents

tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.ENV_ID[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.DAG_ID = 'dataflow_native_java'[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.BUCKET_NAME[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.PUBLIC_BUCKET = 'airflow-system-tests-resources'[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.JAR_FILE_NAME = 'word-count-beam-bundled-0.1.jar'[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.REMOTE_JAR_FILE_PATH[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.GCS_OUTPUT[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.GCS_JAR[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.LOCATION = 'europe-west3'[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.create_bucket[source]
tests.system.providers.google.cloud.dataflow.example_dataflow_native_java.test_run[source]

Was this entry helpful?