Orchestrating data in the mesh of the fragmented modern data stack

Apr 27, 2022·
Dr. Georg Heiler
Dr. Georg Heiler
· 1 min read
Abstract
Orchestrating data pipelines with data
Date
Apr 27, 2022 12:18 AM — 12:20 AM
events

The fragmented modern data stack has emerged as the unbundling of Airflow. Various tools operate in silos. Dagster as a next-generation data orchestrator allows you to clearly see the data dependencies of the individual pipelines on your data factory floor. Following along with my blog post series about Dagster I will cover:

  1. Getting started with dagster and building simple data pipelines
  2. How software-defined assets allow to turn data pipelines around and result in higher quality by allowing to integrate data quality tests straight into the pipelines as well as by separating business logic from infrastructure allowing for better testability.

URLS for reference:

Dr. Georg Heiler
Authors
senior data expert
Georg is a Senior data expert at Magenta and a ML-ops engineer at ASCII. He is solving challenges with data. His interests include geospatial graphs and time series. Georg transitions the data platform of Magenta to the cloud and is handling large scale multi-modal ML-ops challenges at ASCII.