Spark Operator

发表于 2021-09-29 更新于 2021-12-15 分类于 Study ， Spark 阅读次数：

本文字数： 864 阅读时长 ≈ 1 分钟

🌈RDD算子

Spark RDD算子一览：

++                    first                    max                  take
aggregate             flatMap                  min                  takeAsync
barrier               fold                     name                 takeOrdered
cache                 foreach                  partitioner          takeSample
canEqual              foreachAsync             partitions           toDF
cartesian             foreachPartition         persist              toDS
checkpoint            foreachPartitionAsync    pipe                 toDebugString
coalesce              getCheckpointFile        preferredLocations   toJavaRDD
collect               getNumPartitions         productArity         toLocalIterator
collectAsync          getStorageLevel          productElement       toString
compute               glom                     productIterator      top
context               groupBy                  productPrefix        treeAggregate
copy                  id                       randomSplit          treeReduce
count                 intersection             reduce               union
countApprox           isCheckpointed           repartition          unpersist
countApproxDistinct   isEmpty                  sample               zip
countAsync            iterator                 saveAsObjectFile     zipPartitions
countByValue          keyBy                    saveAsTextFile       zipWithIndex
countByValueApprox    localCheckpoint          setName              zipWithUniqueId
dependencies          map                      sortBy
distinct              mapPartitions            sparkContext
filter                mapPartitionsWithIndex   subtract

-------------本文结束感谢您的阅读-------------