StateStoreAwareZipPartitionsRDD
is created exclusively when StreamingSymmetricHashJoinExec
physical operator is requested to execute (and requests StateStoreAwareZipPartitionsHelper for one).
StateStoreAwareZipPartitionsRDD
takes the following to be created:
-
Names of the state stores
getPreferredLocations(partition: Partition): Seq[String]
Note
|
getPreferredLocations is a part of the RDD Contract to specify placement preferences (aka preferred task locations), i.e. where tasks should be executed to be as close to the data as possible.
|
getPreferredLocations
…FIXME