Spark Operator

🌈RDD算子

Spark RDD算子一览:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
++                    first                    max                  take
aggregate flatMap min takeAsync
barrier fold name takeOrdered
cache foreach partitioner takeSample
canEqual foreachAsync partitions toDF
cartesian foreachPartition persist toDS
checkpoint foreachPartitionAsync pipe toDebugString
coalesce getCheckpointFile preferredLocations toJavaRDD
collect getNumPartitions productArity toLocalIterator
collectAsync getStorageLevel productElement toString
compute glom productIterator top
context groupBy productPrefix treeAggregate
copy id randomSplit treeReduce
count intersection reduce union
countApprox isCheckpointed repartition unpersist
countApproxDistinct isEmpty sample zip
countAsync iterator saveAsObjectFile zipPartitions
countByValue keyBy saveAsTextFile zipWithIndex
countByValueApprox localCheckpoint setName zipWithUniqueId
dependencies map sortBy
distinct mapPartitions sparkContext
filter mapPartitionsWithIndex subtract

-------------本文结束感谢您的阅读-------------