Orchestration with Airflow

Make ML pipelines a breeze with Airflow and Anyscale

An Airflow DAG using the Ray Provider

Building a flow of tasks that depend on each other is a critical piece of infrastructure that most ML or Data Platforms within an organization will need.

In Airflow parlance, the recommended integration point is the Airflow-Ray Provider. See below for a diagram for how this fits together alongside other operators, tasks, and your storage layer:

Airflow gets a super-charge when paired with Ray's zero-copy plasma object store

Writing a DAG that uses Ray as the Provider layer looks like this:

See a full example on GitHub that you can pull down and try out here.

Last updated

Was this helpful?