Manju’s Blog

Manju S V

Sr. Associate Technology L2 at Publicis Sapient. M.Tech Computer Science from NITK Surathkal. Interested in Python, Data Science, Machine Learning, Django and Flask

Broadcast Join in Spark

3 minute read

What is a Broadcast Join in Spark? A Broadcast Join in Spark is an optimized join strategy where one of the datasets is broadcasted (shared) to all the nodes...

reduceByKey() vs groupByKey() in Spark

3 minute read

What are reduceByKey() and groupByKey() in Spark? reduceByKey() Definition: Combines values of the same key using a specified reduce function (like sum, m...

Transformations and Actions in Spark

1 minute read

In Apache Spark, two key operations work together to process data: Transformations and Actions. Understanding these concepts helps us efficiently work with l...

Narrow and Wide Transformations in Spark

1 minute read

In Apache Spark, Transformations are divided into two types: Narrow and Wide Transformations. Understanding these helps optimize Spark jobs for performance a...

Jobs, Stages and Tasks in Spark

3 minute read

What is a Job? A Job is a complete Spark operation triggered by an action like collect(), count(), or save().

Manju S V

Recent posts

Broadcast Join in Spark

reduceByKey() vs groupByKey() in Spark

Transformations and Actions in Spark

Narrow and Wide Transformations in Spark

Jobs, Stages and Tasks in Spark