@Evolving public interface SupportsReportPartitioning extends Scan
Scan
. Data sources can implement this interface to
report data partitioning and try to avoid shuffle at Spark side.
Note that, when a Scan
implementation creates exactly one InputPartition
,
Spark may avoid adding a shuffle even if the reader does not implement this interface.
Modifier and Type | Method and Description |
---|---|
Partitioning |
outputPartitioning()
Returns the output data partitioning that this reader guarantees.
|
description, readSchema, reportDriverMetrics, supportedCustomMetrics, toBatch, toContinuousStream, toMicroBatchStream
Partitioning outputPartitioning()