pyspark.streaming.DStream.leftOuterJoin¶

DStream.leftOuterJoin(other: pyspark.streaming.dstream.DStream[Tuple[K, U]], numPartitions: Optional[int] = None) → pyspark.streaming.dstream.DStream[Tuple[K, Tuple[V, Optional[U]]]][source]¶

Return a new DStream by applying ‘left outer join’ between RDDs of this DStream and other DStream.

Hash partitioning is used to generate the RDDs with numPartitions partitions.

pyspark.streaming.DStream.join

pyspark.streaming.DStream.map