Data links KW 20
May 17, 2019·
·
1 min read
Dr. Georg Heiler
- Apache Spark Data Validation
- Submit jobs to spark in parallel
- Architecting Structured Streaming Pipelines the right way
- Understanding query plans and Spark Uis. Great tips on SQL tuning
- Modular Apache Spark Transform Your Code in Pieces
- Parition handling in spark and handoop as well as small files problem and possible solutions
- Apache Kafka Data Access Semantics: Consumers and Membership
- Persitable HyperLogLog in spark using swoop-inc/spark-alchemy

Authors
senior data expert
Georg is a Senior data expert at Magenta and a ML-ops engineer at ASCII.
He is solving challenges with data. His interests include geospatial graphs
and time series. Georg transitions the data platform of Magenta to the cloud
and is handling large scale multi-modal ML-ops challenges at ASCII.