Execute the latest version of spark on HDP.
Use additional python packages with pyspark
Make spark jobs scale reliably using iteration
Get spark and Hive to play nice again on HDP 3.1
Use idempotency of RDD's to your advantage
Bring hexagons as efficient spatial operations to spark
recent history of data processing.
Display user friendly names for cached table in Spark web UI
Preventing data skew issues for Arrays.
Combine the strengths from geomesa and geospark for ultimate geoprocessing capabilities on spark