Skip to content

Latest commit

 

History

History
28 lines (19 loc) · 1.6 KB

spark-sql-streaming-StateStoreAwareZipPartitionsRDD.adoc

File metadata and controls

28 lines (19 loc) · 1.6 KB

StateStoreAwareZipPartitionsRDD

StateStoreAwareZipPartitionsRDD is a ZippedPartitionsRDD2 with the left and right parent RDDs.

StateStoreAwareZipPartitionsRDD is created exclusively when StreamingSymmetricHashJoinExec physical operator is requested to execute (and requests StateStoreAwareZipPartitionsHelper for one).

Creating StateStoreAwareZipPartitionsRDD Instance

StateStoreAwareZipPartitionsRDD takes the following to be created:

Placement Preferences of Partition — getPreferredLocations Method

getPreferredLocations(partition: Partition): Seq[String]
Note
getPreferredLocations is a part of the RDD Contract to specify placement preferences (aka preferred task locations), i.e. where tasks should be executed to be as close to the data as possible.

getPreferredLocations…​FIXME