Posts

Turning the data pipeline inside out

Turning the data pipeline inside out

Flip the data pipeline to get a better notion about the things we actually care about: data assets not transformations.

Georg Heiler

• Mar 4, 2022

From hello-world to simple pipelines

From hello-world to simple pipelines

Getting started with simple dagster pipelines.

Georg Heiler

• Mar 4, 2022

Modern data orchestration using Dagster

Modern data orchestration using Dagster

Overview over the modern data stack ecosystem. Introduction to this blog series

Georg Heiler

• Mar 4, 2022

Interactive dagster debugging

Interactive dagster debugging

Interacting with a running dagster instance interactively

Georg Heiler

• Feb 2, 2022

Scalable sparse matrix multiplication

Scalable sparse matrix multiplication

Using Apache Spark for **sparse** matrix multiplication

Georg Heiler

• Aug 6, 2021

COVID population model

COVID population model

WWTF COVID project summary

Georg Heiler

• May 12, 2021

ML project configuration management

Easy configuration handling for complex machine learning pipelines

Georg Heiler

• May 8, 2021

Can you tell the nuts & berries apart in each group?

Can you tell the nuts & berries apart in each group?

Guaranteed anonymity in high-dimensional data using differential privacy

Georg Heiler

• Mar 8, 2021

Intersting links about deep learning

Georg Heiler

• Dec 3, 2020

Exact percentiles in Spark

Combining the power of Scala and Python to make the calculation of percentiles in Spark easy and fast

Georg Heiler

• Nov 21, 2020