airflow.providers.apache.spark.operators.spark_jdbc¶

Classes¶

SparkJDBCOperator

Extend the SparkSubmitOperator to perform data transfers to/from JDBC-based databases with Apache Spark.

Module Contents¶

class airflow.providers.apache.spark.operators.spark_jdbc.SparkJDBCOperator(*, spark_app_name='airflow-spark-jdbc', spark_conn_id='spark-default', spark_conf=None, spark_py_files=None, spark_files=None, spark_jars=None, cmd_type='spark_to_jdbc', jdbc_table=None, jdbc_conn_id='jdbc-default', jdbc_driver=None, metastore_table=None, jdbc_truncate=False, save_mode=None, save_format=None, batch_size=None, fetch_size=None, num_partitions=None, partition_column=None, lower_bound=None, upper_bound=None, create_table_column_types=None, **kwargs)[source]¶

Bases: airflow.providers.apache.spark.operators.spark_submit.SparkSubmitOperator

Extend the SparkSubmitOperator to perform data transfers to/from JDBC-based databases with Apache Spark.

As with the SparkSubmitOperator, it assumes that the “spark-submit” binary is available on the PATH.