pyspark.streaming.DStream.groupByKey

DStream.groupByKey(numPartitions: Optional[int] = None) → pyspark.streaming.dstream.DStream[Tuple[K, Iterable[V]]][source]

Return a new DStream by applying groupByKey on each RDD.