Python

Orchestrating data in the mesh of the fragmented modern data stack

The fragmented modern data stack has emerged as the unbundling of Airflow. Various tools operate in silos. Dagster as a next-generation data orchestrator allows you to clearly see …

avatar
Dr. Georg Heiler
Making BigData small again (and green) featured image

Making BigData small again (and green)

Towards simpler and perhaps more energy efficient data platforms with increased developer productivity.

avatar
Dr. Georg Heiler
Comparing SQL-based streaming approaches featured image

Comparing SQL-based streaming approaches

Comparing established and up-and-coming streaming approaches for an integrated real-time data model

avatar
Dr. Georg Heiler
Identifying the root cause of cable network problems with machine learning featured image

Identifying the root cause of cable network problems with machine learning

Good quality network connectivity is ever more important. For hybrid fiber coaxial (HFC) networks, searching for upstream high noise in the past was cumbersome and time-consuming. …

avatar
Dr. Georg Heiler
SFTP sensor featured image

SFTP sensor

Way too many data pipelines still work with SFTP file transfer. Even a modern data orchestrator needs to interface here well.

avatar
Dr. Georg Heiler
Connector goodness from Airbyte E2E lineage featured image

Connector goodness from Airbyte E2E lineage

Simplify data ingestion with the plentiful connectors of Airbyte without compromising on data lineage

avatar
Dr. Georg Heiler
Scalable data pipelines from dagster with pyspark featured image

Scalable data pipelines from dagster with pyspark

Getting started with simple dagster pipelines.

avatar
Dr. Georg Heiler
Tame your notebooks featured image

Tame your notebooks

Include jupyter notebooks into reliable data pipelines.

avatar
Dr. Georg Heiler
Fully-fledged example with resources featured image

Fully-fledged example with resources

A full example E2E

avatar
Dr. Georg Heiler
Turning the data pipeline inside out featured image

Turning the data pipeline inside out

Flip the data pipeline to get a better notion about the things we actually care about: data assets not transformations.

avatar
Dr. Georg Heiler