Templates reference¶
Variables, macros and filters can be used in templates (see the Jinja Templating section)
The following come for free out of the box with Airflow.
Additional custom macros can be added globally through Plugins, or at a DAG level through the
DAG.user_defined_macros
argument.
Variables¶
The Airflow engine passes a few variables by default that are accessible in all templates
Variable |
Description |
---|---|
|
Start of the data interval (pendulum.DateTime). |
|
End of the data interval (pendulum.DateTime). |
|
The DAG run’s logical date as |
|
Same as |
|
Same as |
|
Same as |
|
Same as |
|
Start of the data interval from prior successful DAG run
(pendulum.DateTime or |
|
End of the data interval from prior successful DAG run
(pendulum.DateTime or |
|
Start date from prior successful dag run (if available)
(pendulum.DateTime or |
|
The DAG object. |
|
The Task object. |
|
A reference to the macros package, described below. |
|
The task_instance object. |
|
Same as |
|
A reference to the user-defined params dictionary which can be
overridden by the dictionary passed through |
|
Global defined variables represented as a dictionary. |
|
Global defined variables represented as a dictionary. With deserialized JSON object, append the path to the key within the JSON object. |
|
Connection represented as a dictionary. |
|
A unique, human-readable key to the task instance formatted
|
|
The full configuration object located at
|
|
The |
|
A reference to the DagRun object. |
|
Whether the task instance was called using the CLI’s test subcommand. |
Note
The DAG run’s logical date, and values derived from it, such as ds
and
ts
, should not be considered unique in a DAG. Use run_id
instead.
The following variables are deprecated. They are kept for backward compatibility, but you should convert existing code to use other variables instead.
Deprecated Variable |
Description |
---|---|
|
the execution date (logical date), same as |
|
the next execution date (if available) (pendulum.DateTime)
if |
|
the next execution date as |
|
the next execution date as |
|
the previous execution date (if available) (pendulum.DateTime)
if |
|
the previous execution date as |
|
the previous execution date as |
|
the day before the execution date as |
|
the day before the execution date as |
|
the day after the execution date as |
|
the day after the execution date as |
|
execution date from prior successful dag run |
Note that you can access the object’s attributes and methods with simple
dot notation. Here are some examples of what is possible:
{{ task.owner }}
, {{ task.task_id }}
, {{ ti.hostname }}
, …
Refer to the models documentation for more information on the objects’
attributes and methods.
The var
template variable allows you to access variables defined in Airflow’s
UI. You can access them as either plain-text or JSON. If you use JSON, you are
also able to walk nested structures, such as dictionaries like:
{{ var.json.my_dict_var.key1 }}
.
It is also possible to fetch a variable by string if needed with
{{ var.value.get('my.var', 'fallback') }}
or
{{ var.json.get('my.dict.var', {'key1': 'val1'}) }}
. Defaults can be
supplied in case the variable does not exist.
Similarly, Airflow Connections data can be accessed via the conn
template variable.
For example, you could use expressions in your templates like {{ conn.my_conn_id.login }}
,
{{ conn.my_conn_id.password }}
, etc.
Just like with var
it’s possible to fetch a connection by string (e.g. {{ conn.get('my_conn_id_'+index).host }}
) or provide defaults (e.g {{ conn.get('my_conn_id', {"host": "host1", "login": "user1"}).host }}
)
Filters¶
Airflow defines some Jinja filters that can be used to format values.
For example, using {{ execution_date | ds }}
will output the execution_date in the YYYY-MM-DD
format.
Filter |
Operates on |
Description |
---|---|---|
|
datetime |
Format the datetime as |
|
datetime |
Format the datetime as |
|
datetime |
Same as |
|
datetime |
Same as |
|
datetime |
As |
Macros¶
Macros are a way to expose objects to your templates and live under the
macros
namespace in your templates.
A few commonly used libraries and methods are made available.
Variable |
Description |
---|---|
|
The standard lib’s |
|
The standard lib’s |
|
A reference to the |
|
The standard lib’s |
|
The standard lib’s |
|
The standard lib’s |
Some airflow specific macros are also defined:
-
airflow.macros.
datetime_diff_for_humans
(dt: Any, since: Optional[datetime.datetime] = None) → str[source]¶ Return a human-readable/approximate difference between two datetimes, or one and now.
- Parameters
dt (datetime.datetime) – The datetime to display the diff for
since (None or datetime.datetime) – When to display the date from. If
None
then the diff is betweendt
and now.
- Return type
-
airflow.macros.
ds_add
(ds: str, days: int) → str[source]¶ Add or subtract days from a YYYY-MM-DD
- Parameters
>>> ds_add('2015-01-01', 5) '2015-01-06' >>> ds_add('2015-01-06', -5) '2015-01-01'
-
airflow.macros.
ds_format
(ds: str, input_format: str, output_format: str) → str[source]¶ Takes an input string and outputs another string as specified in the output format
- Parameters
>>> ds_format('2015-01-01', "%Y-%m-%d", "%m-%d-%y") '01-01-15' >>> ds_format('1/5/2015', "%m/%d/%Y", "%Y-%m-%d") '2015-01-05'
-
airflow.macros.
random
() → x in the interval [0, 1).¶
-
airflow.macros.hive.
closest_ds_partition
(table, ds, before=True, schema='default', metastore_conn_id='metastore_default')[source]¶ This function finds the date in a list closest to the target date. An optional parameter can be given to get the closest before or after.
- Parameters
- Returns
The closest date
- Return type
>>> tbl = 'airflow.static_babynames_partitioned' >>> closest_ds_partition(tbl, '2015-01-02') '2015-01-01'
-
airflow.macros.hive.
max_partition
(table, schema='default', field=None, filter_map=None, metastore_conn_id='metastore_default')[source]¶ Gets the max partition for a table.
- Parameters
schema (str) – The hive schema the table lives in
table (str) – The hive table you are interested in, supports the dot notation as in “my_database.my_table”, if a dot is found, the schema param is disregarded
metastore_conn_id (str) – The hive connection you are interested in. If your default is set you don’t need to use this parameter.
filter_map (dict) – partition_key:partition_value map used for partition filtering, e.g. {‘key1’: ‘value1’, ‘key2’: ‘value2’}. Only partitions matching all partition_key:partition_value pairs will be considered as candidates of max partition.
field (str) – the field to get the max value from. If there’s only one partition field, this will be inferred
>>> max_partition('airflow.static_babynames_partitioned') '2015-01-01'