Class BarrierTaskContext
- All Implemented Interfaces:
- Serializable,- org.apache.spark.internal.Logging
TaskContext with extra contextual info and tooling for tasks in a barrier stage.
 Use get() to obtain the barrier context for a running barrier task.- See Also:
- 
Nested Class SummaryNested classes/interfaces inherited from interface org.apache.spark.internal.Loggingorg.apache.spark.internal.Logging.LogStringContext, org.apache.spark.internal.Logging.SparkShellLoggingFilter
- 
Method SummaryModifier and TypeMethodDescriptionAdds a (Java friendly) listener to be executed on task completion.addTaskFailureListener(TaskFailureListener listener) Adds a listener to be executed on task failure (which includes completion listener failure, if the task body did not already fail).String[]:: Experimental :: Blocks until all tasks in the same stage have reached this routine.intHow many times this task has been attempted.voidbarrier():: Experimental :: Sets a global barrier and waits until all tasks in this stage hit this barrier.intcpus()CPUs allocated to the task.static BarrierTaskContextget():: Experimental :: Returns the currently active BarrierTaskContext.getLocalProperty(String key) Get a local property set upstream in the driver, or null if it is missing.scala.collection.immutable.Seq<Source>getMetricsSources(String sourceName) ::DeveloperApi:: Returns all metrics sources with the given name which are associated with the instance which runs the task.:: Experimental :: ReturnsBarrierTaskInfofor all tasks in this barrier stage, ordered by partition ID.booleanReturns true if the task has completed.booleanisFailed()Returns true if the task has failed.booleanReturns true if the task has been killed.intTotal number of partitions in the stage that this task belongs to.intThe ID of the RDD partition that is computed by this task.scala.collection.immutable.Map<String,ResourceInformation> Resources allocated to the task.(java-specific) Resources allocated to the task.intHow many times the stage that this task belongs to has been attempted.intstageId()The ID of the stage that this task belong to.longAn ID that is unique to this task attempt (within the same SparkContext, no two task attempts will share the same attempt ID).org.apache.spark.executor.TaskMetricsMethods inherited from class org.apache.spark.TaskContextaddTaskCompletionListener, addTaskFailureListener, getPartitionIdMethods inherited from class java.lang.Objectequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.spark.internal.LogginginitializeForcefully, initializeLogIfNecessary, initializeLogIfNecessary, initializeLogIfNecessary$default$2, isTraceEnabled, log, logBasedOnLevel, logDebug, logDebug, logDebug, logDebug, logError, logError, logError, logError, logInfo, logInfo, logInfo, logInfo, logName, LogStringContext, logTrace, logTrace, logTrace, logTrace, logWarning, logWarning, logWarning, logWarning, org$apache$spark$internal$Logging$$log_, org$apache$spark$internal$Logging$$log__$eq, withLogContext
- 
Method Details- 
get:: Experimental :: Returns the currently active BarrierTaskContext. This can be called inside of user functions to access contextual information about running barrier tasks.- Returns:
- (undocumented)
 
- 
barrierpublic void barrier():: Experimental :: Sets a global barrier and waits until all tasks in this stage hit this barrier. Similar to MPI_Barrier function in MPI, the barrier() function call blocks until all tasks in the same stage have reached this routine.CAUTION! In a barrier stage, each task must have the same number of barrier() calls, in all possible code branches. Otherwise, you may get the job hanging or a SparkException after timeout. Some examples of '''misuses''' are listed below: 1. Only call barrier() function on a subset of all the tasks in the same barrier stage, it shall lead to timeout of the function call. rdd.barrier().mapPartitions { iter => val context = BarrierTaskContext.get() if (context.partitionId() == 0) { // Do nothing. } else { context.barrier() } iter }2. Include barrier() function in a try-catch code block, this may lead to timeout of the second function call. rdd.barrier().mapPartitions { iter => val context = BarrierTaskContext.get() try { // Do something that might throw an Exception. doSomething() context.barrier() } catch { case e: Exception => logWarning("...", e) } context.barrier() iter }
- 
allGather:: Experimental :: Blocks until all tasks in the same stage have reached this routine. Each task passes in a message and returns with a list of all the messages passed in by each of those tasks.CAUTION! The allGather method requires the same precautions as the barrier method The message is type String rather than Array[Byte] because it is more convenient for the user at the cost of worse performance. - Parameters:
- message- (undocumented)
- Returns:
- (undocumented)
 
- 
getTaskInfos:: Experimental :: ReturnsBarrierTaskInfofor all tasks in this barrier stage, ordered by partition ID.- Returns:
- (undocumented)
 
- 
isCompletedpublic boolean isCompleted()Description copied from class:TaskContextReturns true if the task has completed.- Specified by:
- isCompletedin class- TaskContext
- Returns:
- (undocumented)
 
- 
isFailedpublic boolean isFailed()Description copied from class:TaskContextReturns true if the task has failed.- Specified by:
- isFailedin class- TaskContext
- Returns:
- (undocumented)
 
- 
isInterruptedpublic boolean isInterrupted()Description copied from class:TaskContextReturns true if the task has been killed.- Specified by:
- isInterruptedin class- TaskContext
- Returns:
- (undocumented)
 
- 
addTaskCompletionListenerDescription copied from class:TaskContextAdds a (Java friendly) listener to be executed on task completion. This will be called in all situations - success, failure, or cancellation. Adding a listener to an already completed task will result in that listener being called immediately.Two listeners registered in the same thread will be invoked in reverse order of registration if the task completes after both are registered. There are no ordering guarantees for listeners registered in different threads, or for listeners registered after the task completes. Listeners are guaranteed to execute sequentially. An example use is for HadoopRDD to register a callback to close the input stream. Exceptions thrown by the listener will result in failure of the task. - Specified by:
- addTaskCompletionListenerin class- TaskContext
- Parameters:
- listener- (undocumented)
- Returns:
- (undocumented)
 
- 
addTaskFailureListenerDescription copied from class:TaskContextAdds a listener to be executed on task failure (which includes completion listener failure, if the task body did not already fail). Adding a listener to an already failed task will result in that listener being called immediately.Note: Prior to Spark 3.4.0, failure listeners were only invoked if the main task body failed. - Specified by:
- addTaskFailureListenerin class- TaskContext
- Parameters:
- listener- (undocumented)
- Returns:
- (undocumented)
 
- 
stageIdpublic int stageId()Description copied from class:TaskContextThe ID of the stage that this task belong to.- Specified by:
- stageIdin class- TaskContext
- Returns:
- (undocumented)
 
- 
stageAttemptNumberpublic int stageAttemptNumber()Description copied from class:TaskContextHow many times the stage that this task belongs to has been attempted. The first stage attempt will be assigned stageAttemptNumber = 0, and subsequent attempts will have increasing attempt numbers.- Specified by:
- stageAttemptNumberin class- TaskContext
- Returns:
- (undocumented)
 
- 
partitionIdpublic int partitionId()Description copied from class:TaskContextThe ID of the RDD partition that is computed by this task.- Specified by:
- partitionIdin class- TaskContext
- Returns:
- (undocumented)
 
- 
numPartitionspublic int numPartitions()Description copied from class:TaskContextTotal number of partitions in the stage that this task belongs to.- Specified by:
- numPartitionsin class- TaskContext
- Returns:
- (undocumented)
 
- 
attemptNumberpublic int attemptNumber()Description copied from class:TaskContextHow many times this task has been attempted. The first task attempt will be assigned attemptNumber = 0, and subsequent attempts will have increasing attempt numbers.- Specified by:
- attemptNumberin class- TaskContext
- Returns:
- (undocumented)
 
- 
taskAttemptIdpublic long taskAttemptId()Description copied from class:TaskContextAn ID that is unique to this task attempt (within the same SparkContext, no two task attempts will share the same attempt ID). This is roughly equivalent to Hadoop's TaskAttemptID.- Specified by:
- taskAttemptIdin class- TaskContext
- Returns:
- (undocumented)
 
- 
getLocalPropertyDescription copied from class:TaskContextGet a local property set upstream in the driver, or null if it is missing. See alsoorg.apache.spark.SparkContext.setLocalProperty.- Specified by:
- getLocalPropertyin class- TaskContext
- Parameters:
- key- (undocumented)
- Returns:
- (undocumented)
 
- 
taskMetricspublic org.apache.spark.executor.TaskMetrics taskMetrics()- Specified by:
- taskMetricsin class- TaskContext
 
- 
getMetricsSourcesDescription copied from class:TaskContext::DeveloperApi:: Returns all metrics sources with the given name which are associated with the instance which runs the task. For more information seeorg.apache.spark.metrics.MetricsSystem.- Specified by:
- getMetricsSourcesin class- TaskContext
- Parameters:
- sourceName- (undocumented)
- Returns:
- (undocumented)
 
- 
cpuspublic int cpus()Description copied from class:TaskContextCPUs allocated to the task.- Specified by:
- cpusin class- TaskContext
- Returns:
- (undocumented)
 
- 
resourcesDescription copied from class:TaskContextResources allocated to the task. The key is the resource name and the value is information about the resource. Please refer toResourceInformationfor specifics.- Specified by:
- resourcesin class- TaskContext
- Returns:
- (undocumented)
 
- 
resourcesJMapDescription copied from class:TaskContext(java-specific) Resources allocated to the task. The key is the resource name and the value is information about the resource. Please refer toResourceInformationfor specifics.- Specified by:
- resourcesJMapin class- TaskContext
- Returns:
- (undocumented)
 
 
-