apache-airflow-providers-apache-hdfs

Package apache-airflow-providers-apache-hdfs

Hadoop Distributed File System (HDFS) and WebHDFS.

Release: 4.0.0

Provider package

This is a provider package for apache.hdfs provider. All classes for this provider package are in airflow.providers.apache.hdfs python package.

Installation

You can install this package on top of an existing Airflow 2 installation (see Requirements below) for the minimum Airflow version supported) via pip install apache-airflow-providers-apache-hdfs

Requirements

The minimum Apache Airflow version supported by this provider package is 2.4.0.

PIP package

Version required

apache-airflow

>=2.4.0

hdfs[avro,dataframe,kerberos]

>=2.0.4

Changelog

4.0.0

Note

This release of provider is only available for Airflow 2.4+ as explained in the Apache Airflow providers support policy.

Breaking changes

The original HDFS Hook and sensor has been removed. It used the old HDFS snakebite-py3 library that had no update in years and the protobuf they are using reached end of life.

The 3.* version of the provider is still available and can be used if you need to use the old hooks and sensors.

The HDFSHook, HDFSSensor, HdfsRegexSensor, HdfsRegexSensor that have been removed from this provider and they are not available any more. If you want to continue using them, you can use 3.* version of the provider, but the recommendation is to switch to the new WebHDFSHook and WebHDFSSensor that use the WebHDFS API.

  • Remove snakebite-py3 based HDFS hooks and sensors (#31262)

Misc

  • Bump minimum Airflow version in providers (#30917)

3.2.1

Bug Fixes

  • Fix HDFSHook HAClient is invalid (#30164)

3.2.0

Note

This release of provider is only available for Airflow 2.3+ as explained in the Apache Airflow providers support policy.

Misc

  • Move min airflow version to 2.3.0 for all providers (#27196)

3.1.0

Features

  • Adding Authentication to webhdfs sensor  (#25110)

3.0.1

Bug Fixes

  • 'WebHDFSHook' Bugfix/optional port (#24550)

3.0.0

Breaking changes

Misc

  • chore: Refactoring and Cleaning Apache Providers (#24219)

2.2.3

Bug Fixes

  • Fix mistakenly added install_requires for all providers (#22382)

2.2.2

Misc

  • Add Trove classifiers in PyPI (Framework :: Apache Airflow :: Provider)

2.2.1

Misc

  • Support for Python 3.10

  • Add how-to guide for WebHDFS operators (#21393)

2.2.0

Features

  • hdfs provider: restore HA support for webhdfs (#19711)

2.1.1

Bug Fixes

  • fix get_connections deprecation warning in webhdfs hook (#18331)

2.1.0

Features

  • hdfs provider: allow SSL webhdfs connections (#17637)

Misc

  • Optimise connection importing for Airflow 2.2.0

2.0.0

Breaking changes

  • Auto-apply apply_default decorator (#15667)

Warning

Due to apply_default decorator removal, this version of the provider requires Airflow 2.1.0+. If your Airflow version is < 2.1.0, and you want to install this provider version, first upgrade Airflow to at least version 2.1.0. Otherwise your Airflow package version will be upgraded automatically and you will have to manually run airflow upgrade db to complete the migration.

1.0.1

Updated documentation and readme files.

1.0.0

Initial version of the provider.

Was this entry helpful?