Packages

trait Batch extends AnyRef

A physical representation of a data source scan for batch queries. This interface is used to provide physical information, like how many partitions the scanned data has, and how to read records from the partitions.

Annotations
@Evolving()
Source
Batch.java
Since

3.0.0

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. Batch
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. Protected

Abstract Value Members

  1. abstract def createReaderFactory(): PartitionReaderFactory

    Returns a factory to create a PartitionReader for each InputPartition.

  2. abstract def planInputPartitions(): Array[InputPartition]

    Returns a list of input partitions.

    Returns a list of input partitions. Each InputPartition represents a data split that can be processed by one Spark task. The number of input partitions returned here is the same as the number of RDD partitions this scan outputs.

    If the Scan supports filter pushdown, this Batch is likely configured with a filter and is responsible for creating splits for that filter, which is not a full scan.

    This method will be called only once during a data source scan, to launch one Spark job.