L-Mds

Open Data Hackathon Wien 25
Open Data Hackathon Wien 25

20 minute impulse talk about open data and priciples of handling data. See https://docs.google.com/presentation/d/1W7MiXO-6qrYADONrIiJdOtY_zFPB9QSghJ_JjMiVaAU/edit?slide=id.g33e86fc9cec_0_15#slide=id.g33e86fc9cec_0_15 for the slides. https://www.data.gv.at/2025/03/19/open-data-hackathon-im-metalab-wien-datenschaetze-entdecken-und-nutzbar-machen/ for the event some interesting learnings high value open data sets https://www.data.gv.at/en/2023/01/26/innovation-potential-through-public-data-eu-commission-obliges-member-states-to-release-high-value-data-sets/ https://www.data.gv.at/katalog/dataset/e91bd464-be86-453c-b693-2ab818e11df2 https://www.data.gv.at/wp-content/uploads/2023/01/Auflistung-HVDs-Details_datagvat_26012023.pdf https://justizonline.gv.at/jop/web/iwg https://www.offenerhaushalt.at/ data how-to https://georgheiler.com/post/learning-data-engineering/ https://github.com/l-mds/local-data-stack

Apr 21, 2025

Upskilling data engineers
Upskilling data engineers

A comprehensive guide to modern data engineering with local-first development practices

Mar 14, 2025

Pixi powering Telekom data cloud
Pixi powering Telekom data cloud

Pixi is a tool which enables efficient dependency handling. It is created from prefix.dev, built in Rust and very fast. For us at Magenta Telekom in Austria Pixi is beneficial as we build our new data platform around metadata and strong governance and an explicit graph of data dependencies. In this talk we share our experience with Pixi and how it empowers our data infrastructure - in conjunction with Dagster.

Jan 31, 2025

Local data stack template
Local data stack template

Jumpstart your data processing with this local modern data stack template

Oct 25, 2024

Cost efficient alternative to databricks lock-in
Cost efficient alternative to databricks lock-in

Spark-based data PaaS solutions are convenient. But they come with their own set of challenges such as a high vendor lock-in and obscured costs. We show how to use a dedicated orchestrator ([dagster-pipes](https://docs.dagster.io/guides/dagster-pipes)). It can not only make Databricks an implementation detail but also save cost. Also, it improves developer productivity. It allows you to take back control.

Sep 12, 2024

Cloud arbitrage for spark pipelines
Cloud arbitrage for spark pipelines

Spark-based data PaaS solutions are convenient. But they come with their own set of challenges such as a high vendor lock-in and obscured costs. We show how to use a dedicated orchestrator ([dagster-pipes](https://docs.dagster.io/guides/dagster-pipes)). It can not only make Databricks an implementation detail but also save cost. Also, it improves developer productivity. It allows you to take back control.

Jun 21, 2024

Cost efficient alternative to databricks lock-in
Cost efficient alternative to databricks lock-in

Save money 💰 and increase developer productivity 👩‍💻👨‍💻 by limiting scope-creep of Spark-based data PaaS solutions: 🌐 turn them into an implementation detail 🔧.

Jun 21, 2024

Dagster, dbt, duckdb as new local MDS
Dagster, dbt, duckdb as new local MDS

Lean and efficient MDS experience: Delivers better software engineering practices to the data ecosystem with the new local MDS stack comprised of Dagster, dbt and DuckDB which offers better developer productivity by enhancing testability of the E2E pipeline.

Dec 11, 2023