Content¶
Guides
References
Commits
apache-airflow-providers-apache-hdfs
¶Hadoop Distributed File System (HDFS) and WebHDFS.
Release: 4.0.0
This is a provider package for apache.hdfs
provider. All classes for this provider package
are in airflow.providers.apache.hdfs
python package.
You can install this package on top of an existing Airflow 2 installation (see Requirements
below)
for the minimum Airflow version supported) via
pip install apache-airflow-providers-apache-hdfs
The minimum Apache Airflow version supported by this provider package is 2.4.0
.
PIP package |
Version required |
---|---|
|
|
|
|
Note
This release of provider is only available for Airflow 2.4+ as explained in the Apache Airflow providers support policy.
The original HDFS Hook and sensor has been removed. It used the old HDFS snakebite-py3 library that had no update in years and the protobuf they are using reached end of life.
The 3.* version of the provider is still available and can be used if you need to use the old hooks and sensors.
The HDFSHook
, HDFSSensor
, HdfsRegexSensor
, HdfsRegexSensor
that have been removed from
this provider and they are not available any more. If you want to continue using them,
you can use 3.* version of the provider, but the recommendation is to switch to the new
WebHDFSHook
and WebHDFSSensor
that use the WebHDFS
API.
Remove snakebite-py3 based HDFS hooks and sensors (#31262)
Fix HDFSHook HAClient is invalid (#30164)
Note
This release of provider is only available for Airflow 2.3+ as explained in the Apache Airflow providers support policy.
This release of provider is only available for Airflow 2.2+ as explained in the Apache Airflow providers support policy https://github.com/apache/airflow/blob/main/README.md#support-for-providers
hdfs provider: allow SSL webhdfs connections (#17637)
Auto-apply apply_default decorator (#15667)
Warning
Due to apply_default decorator removal, this version of the provider requires Airflow 2.1.0+.
If your Airflow version is < 2.1.0, and you want to install this provider version, first upgrade
Airflow to at least version 2.1.0. Otherwise your Airflow package version will be upgraded
automatically and you will have to manually run airflow upgrade db
to complete the migration.
Updated documentation and readme files.
Initial version of the provider.