Databricks

apache-airflow-providers-databricks

Databricks

Works with Airflow 2.11+
Install:
pip install apache-airflow-providers-databricks==7.11.0

Airflow

2.11+

Python

>=3.10

Dependencies (11)

Show all Hide apache-airflow>=2.11.0 apache-airflow-providers-common-compat>=1.13.0 apache-airflow-providers-common-sql>=1.32.0 requests>=2.32.0,<3 databricks-sql-connector>=4.0.0 aiohttp>=3.9.2, <4 mergedeep>=1.3.4 pandas>=2.1.2; python_version <"3.13" pandas>=2.2.3; python_version >="3.13" pyarrow>=16.1.0; python_version < '3.13' pyarrow>=18.0.0; python_version >= '3.13'

Connections (1)

Modules

O

DatabricksCreateJobsOperator

Creates (or resets) a Databricks job using the API endpoint.

airflow.providers.databricks.operators.databricks.DatabricksCreateJobsOperator
O

DatabricksNotebookOperator

Runs a notebook on Databricks using an Airflow operator.

airflow.providers.databricks.operators.databricks.DatabricksNotebookOperator
O

DatabricksRunNowOperator

Runs an existing Spark job run to Databricks using the api/2.2/jobs/run-now API endpoint.

airflow.providers.databricks.operators.databricks.DatabricksRunNowOperator
O

DatabricksSQLStatementsOperator

Submits a Databricks SQL Statement to Databricks using the api/2.0/sql/statements/ API endpoint.

airflow.providers.databricks.operators.databricks.DatabricksSQLStatementsOperator
O

DatabricksSubmitRunOperator

Submits a Spark job run to Databricks using the api/2.2/jobs/runs/submit API endpoint.

airflow.providers.databricks.operators.databricks.DatabricksSubmitRunOperator
O

DatabricksTaskBaseOperator

Base class for operators that are run as Databricks job tasks or tasks within a Databricks workflow.

airflow.providers.databricks.operators.databricks.DatabricksTaskBaseOperator
O

DatabricksTaskOperator

Runs a task on Databricks using an Airflow operator.

airflow.providers.databricks.operators.databricks.DatabricksTaskOperator
O

DatabricksCopyIntoOperator

Executes COPY INTO command in a Databricks SQL endpoint or a Databricks cluster.

airflow.providers.databricks.operators.databricks_sql.DatabricksCopyIntoOperator
O

DatabricksSqlOperator

Executes SQL code in a Databricks SQL endpoint or a Databricks cluster.

airflow.providers.databricks.operators.databricks_sql.DatabricksSqlOperator
O

DatabricksReposCreateOperator

Creates, and optionally checks out, a Databricks Repo using the POST api/2.0/repos API endpoint.

airflow.providers.databricks.operators.databricks_repos.DatabricksReposCreateOperator
O

DatabricksReposDeleteOperator

Deletes specified repository using the DELETE api/2.0/repos API endpoint.

airflow.providers.databricks.operators.databricks_repos.DatabricksReposDeleteOperator
O

DatabricksReposUpdateOperator

Updates specified repository to a given branch or tag using the PATCH api/2.0/repos API endpoint.

airflow.providers.databricks.operators.databricks_repos.DatabricksReposUpdateOperator
H

DatabricksHook

Interact with Databricks.

airflow.providers.databricks.hooks.databricks.DatabricksHook
H

DatabricksSqlHook

Hook to interact with Databricks SQL.

airflow.providers.databricks.hooks.databricks_sql.DatabricksSqlHook
S

DatabricksSQLStatementsSensor

DatabricksSQLStatementsSensor.

airflow.providers.databricks.sensors.databricks.DatabricksSQLStatementsSensor
S

DatabricksSqlSensor

Sensor that runs a SQL query on Databricks.

airflow.providers.databricks.sensors.databricks_sql.DatabricksSqlSensor
S

DatabricksPartitionSensor

Sensor to detect the presence of table partitions in Databricks.

airflow.providers.databricks.sensors.databricks_partition.DatabricksPartitionSensor
T

DatabricksExecutionTrigger

The trigger handles the logic of async communication with DataBricks API.

airflow.providers.databricks.triggers.databricks.DatabricksExecutionTrigger
T

DatabricksSQLStatementExecutionTrigger

The trigger handles the logic of async communication with DataBricks SQL Statements API.

airflow.providers.databricks.triggers.databricks.DatabricksSQLStatementExecutionTrigger