apache-airflow-providers-apache-hdfs

Package apache-airflow-providers-apache-hdfs

Hadoop Distributed File System (HDFS) and WebHDFS.

Note

The snakebite-py3 used by the provider is an old package and it has an old version of argparse in its dependencies and it might cause in some cases running airflow commands raises the error similar to TypeError: __init__() got an unexpected keyword argument 'encoding'. In this case make sure to remove argparse with pip uninstall argparse command to get rid of this error.

Release: 3.2.0

Provider package

This is a provider package for apache.hdfs provider. All classes for this provider package are in airflow.providers.apache.hdfs python package.

Installation

You can install this package on top of an existing Airflow 2 installation (see Requirements below) for the minimum Airflow version supported) via pip install apache-airflow-providers-apache-hdfs

Requirements

PIP package

Version required

apache-airflow

>=2.3.0

snakebite-py3

hdfs[avro,dataframe,kerberos]

>=2.0.4

Changelog

3.2.0

This release of provider is only available for Airflow 2.3+ as explained in the Apache Airflow providers support policy.

Misc

  • Move min airflow version to 2.3.0 for all providers (#27196)

3.1.0

Features

  • Adding Authentication to webhdfs sensor  (#25110)

3.0.1

Bug Fixes

  • 'WebHDFSHook' Bugfix/optional port (#24550)

3.0.0

Breaking changes

Misc

  • chore: Refactoring and Cleaning Apache Providers (#24219)

2.2.3

Bug Fixes

  • Fix mistakenly added install_requires for all providers (#22382)

2.2.2

Misc

  • Add Trove classifiers in PyPI (Framework :: Apache Airflow :: Provider)

2.2.1

Misc

  • Support for Python 3.10

  • Add how-to guide for WebHDFS operators (#21393)

2.2.0

Features

  • hdfs provider: restore HA support for webhdfs (#19711)

2.1.1

Bug Fixes

  • fix get_connections deprecation warning in webhdfs hook (#18331)

2.1.0

Features

  • hdfs provider: allow SSL webhdfs connections (#17637)

Misc

  • Optimise connection importing for Airflow 2.2.0

2.0.0

Breaking changes

  • Auto-apply apply_default decorator (#15667)

Warning

Due to apply_default decorator removal, this version of the provider requires Airflow 2.1.0+. If your Airflow version is < 2.1.0, and you want to install this provider version, first upgrade Airflow to at least version 2.1.0. Otherwise your Airflow package version will be upgraded automatically and you will have to manually run airflow upgrade db to complete the migration.

1.0.1

Updated documentation and readme files.

1.0.0

Initial version of the provider.

Was this entry helpful?