Posts

Arrow 2.0.0 - structs in pandas
Finally, nested types in Arrow.
Sparkling SCD2
Data preparation using spark without ACID tables
Intersting links about Bayesian modeling
Useful links
Run the latest version of spark
Execute the latest version of spark on HDP.
Run the latest version of spark
Intersting links about IoT
Useful links
Production grade pyspark jobs
Use additional python packages with pyspark