pyspark.RDD.collectWithJobGroup

RDD.collectWithJobGroup(groupId: str, description: str, interruptOnCancel: bool = False) → List[T][source]

When collect rdd, use this method to specify job group.

New in version 3.0.0.

Deprecated since version 3.1.0: Use pyspark.InheritableThread with the pinned thread mode enabled.

Parameters
groupIdstr

The group ID to assign.

descriptionstr

The description to set for the job group.

interruptOnCancelbool, optional, default False

whether to interrupt jobs on job cancellation.

Returns
list

a list containing all the elements