20 minute impulse talk about open data and priciples of handling data. See https://docs.google.com/presentation/d/1W7MiXO-6qrYADONrIiJdOtY_zFPB9QSghJ_JjMiVaAU/edit?slide=id.g33e86fc9cec_0_15#slide=id.g33e86fc9cec_0_15 for the slides. https://www.data.gv.at/2025/03/19/open-data-hackathon-im-metalab-wien-datenschaetze-entdecken-und-nutzbar-machen/ for the event some interesting learnings high value open data sets https://www.data.gv.at/en/2023/01/26/innovation-potential-through-public-data-eu-commission-obliges-member-states-to-release-high-value-data-sets/ https://www.data.gv.at/katalog/dataset/e91bd464-be86-453c-b693-2ab818e11df2 https://www.data.gv.at/wp-content/uploads/2023/01/Auflistung-HVDs-Details_datagvat_26012023.pdf https://justizonline.gv.at/jop/web/iwg https://www.offenerhaushalt.at/ data how-to https://georgheiler.com/post/learning-data-engineering/ https://github.com/l-mds/local-data-stack
Apr 21, 2025
A comprehensive guide to modern data engineering with local-first development practices
Mar 14, 2025
Pixi is a tool which enables efficient dependency handling. It is created from prefix.dev, built in Rust and very fast. For us at Magenta Telekom in Austria Pixi is beneficial as we build our new data platform around metadata and strong governance and an explicit graph of data dependencies. In this talk we share our experience with Pixi and how it empowers our data infrastructure - in conjunction with Dagster.
Jan 31, 2025
Jumpstart your data processing with this local modern data stack template
Oct 25, 2024
Spark-based data PaaS solutions are convenient. But they come with their own set of challenges such as a high vendor lock-in and obscured costs. We show how to use a dedicated orchestrator ([dagster-pipes](https://docs.dagster.io/guides/dagster-pipes)). It can not only make Databricks an implementation detail but also save cost. Also, it improves developer productivity. It allows you to take back control.
Sep 12, 2024
Spark-based data PaaS solutions are convenient. But they come with their own set of challenges such as a high vendor lock-in and obscured costs. We show how to use a dedicated orchestrator ([dagster-pipes](https://docs.dagster.io/guides/dagster-pipes)). It can not only make Databricks an implementation detail but also save cost. Also, it improves developer productivity. It allows you to take back control.
Jun 21, 2024
Save money 💰 and increase developer productivity 👩💻👨💻 by limiting scope-creep of Spark-based data PaaS solutions: 🌐 turn them into an implementation detail 🔧.
Jun 21, 2024
Lean and efficient MDS experience: Delivers better software engineering practices to the data ecosystem with the new local MDS stack comprised of Dagster, dbt and DuckDB which offers better developer productivity by enhancing testability of the E2E pipeline.
Dec 11, 2023