Cache vs Persist in Spark
Managed vs external table in Spark
Repartition vs Coalesce in Spark
Caching an RDD in Spark
Broadcast Join in Spark
reduceByKey() vs groupByKey() in Spark
Transformations and Actions in Spark
Narrow and Wide Transformations in Spark
Jobs, Stages and Tasks in Spark