apache-spark

Arrow 2.0.0 - structs in pandas
Finally, nested types in Arrow.
Sparkling SCD2
Data preparation using spark without ACID tables
Run the latest version of spark
Execute the latest version of spark on HDP.
Run the latest version of spark
Production grade pyspark jobs
Use additional python packages with pyspark