[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / raw/concept/kbhcommon_spark_transformations.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "Common Spark Transformations" source: https://www.jemoka.com/posts/kbhcommon_spark_transformations/ --- map(func): apply a function on all functions filter(func): filter based on function flatMap(func): flatten returned lists into one giant list union(rdd): create a union of multiple RDD0 subtract(rdd): subtract RDDs cartesian(rdd): cartesian product of rdd parallelize(list): make an RDD from list Special transformations for Pair RDDs reduceByKey(func): key things groupByKey(func): key things sortByKey(func): key things See also Database “Join”