Georg Heiler
Georg Heiler
Home
Blog
Publications
Projects
Lecturing
Talks
Contact
Light
Dark
Automatic
Posts
Cost efficient alternative to databricks lock-in
Save money 💰 and increase developer productivity 👩💻👨💻 by limiting scope-creep of Spark-based data PaaS solutions: 🌐 turn them into an implementation detail 🔧.
Georg Heiler
,
Hernan Picatto
Jun 21, 2024
22 min read
Dagster, dbt, duckdb as new local MDS
Lean and efficient MDS experience: Delivers better software engineering practices to the data ecosystem with the new local MDS stack comprised of Dagster, dbt and DuckDB which offers better developer productivity by enhancing testability of the E2E pipeline.
Aleksandar Milicevic
,
Georg Heiler
Dec 11, 2023
19 min read
Securing Secrets with Mozilla SopS and AGE: A Powerful Combo
🔐 Exploring the power duo of Mozilla’s sops & AGE for secret management! Dive deep into their benefits: simplicity, version control compatibility & robust encryption. Secure your data the modern way! 💻🛡️ #ITSecurity #Encryption #DataProtection
Georg Heiler
Dec 1, 2023
4 min read
Unlocking Advanced Metadata Extraction with the New DBT API in Dagster
📊 Unleash the power of metadata extraction in your data engineering pipelines with the new DBT API in Dagster! 🚀 Learn how to seamlessly integrate and leverage DBT transformations, while enriching your data catalog with advanced metadata. Elevate your data governance and collaboration to new heights!
Georg Heiler
Jun 13, 2023
4 min read
Making BigData small again (and green)
Towards simpler and perhaps more energy efficient data platforms with increased developer productivity.
Georg Heiler
Apr 2, 2022
9 min read
Comparing SQL-based streaming approaches
Comparing established and up-and-coming streaming approaches for an integrated real-time data model
Georg Heiler
Apr 1, 2022
28 min read
SFTP sensor
Way too many data pipelines still work with SFTP file transfer. Even a modern data orchestrator needs to interface here well.
Georg Heiler
,
Sandy Ryza
Mar 4, 2022
5 min read
Connector goodness from Airbyte E2E lineage
Simplify data ingestion with the plentiful connectors of Airbyte without compromising on data lineage
Georg Heiler
,
Sandy Ryza
Mar 4, 2022
3 min read
Scalable data pipelines from dagster with pyspark
Getting started with simple dagster pipelines.
Georg Heiler
,
Sandy Ryza
Mar 4, 2022
5 min read
Tame your notebooks
Include jupyter notebooks into reliable data pipelines.
Georg Heiler
,
Sandy Ryza
Mar 4, 2022
4 min read
»
Cite
×