ContinuousReader (Spark 2.3.4 JavaDoc)

All Superinterfaces:

org.apache.spark.sql.execution.streaming.BaseStreamingSource, DataSourceReader
```
@InterfaceStability.Evolving
public interface ContinuousReader
extends org.apache.spark.sql.execution.streaming.BaseStreamingSource, DataSourceReader
```
A mix-in interface for DataSourceReader. Data source readers can implement this interface to allow reading in a continuous processing mode stream. Implementations must ensure each reader factory output is a ContinuousDataReader. Note: This class currently extends BaseStreamingSource to maintain compatibility with DataSource V1 APIs. This extension will be removed once we get rid of V1 completely.

Method Summary

All Methods Instance Methods Abstract Methods Default Methods
Modifier and Type	Method and Description
`void`	`commit(Offset end)` Informs the source that Spark has completed processing all data for offsets less than or equal to `end` and will only request offsets greater than `end` in the future.
`Offset`	`deserializeOffset(String json)` Deserialize a JSON string into an Offset of the implementation-defined offset type.
`Offset`	`getStartOffset()` Return the specified or inferred start offset for this reader.
`Offset`	`mergeOffsets(PartitionOffset[] offsets)` Merge partitioned offsets coming from `ContinuousDataReader` instances for each partition to a single global offset.
`default boolean`	`needsReconfiguration()` The execution engine will call this method in every epoch to determine if new reader factories need to be generated, which may be required if for example the underlying source system has had partitions added or removed.
`void`	`setStartOffset(java.util.Optional<Offset> start)` Set the desired start offset for reader factories created from this reader.

Methods inherited from interface org.apache.spark.sql.execution.streaming.BaseStreamingSource
stop

Methods inherited from interface org.apache.spark.sql.sources.v2.reader.DataSourceReader
createDataReaderFactories, readSchema

- Method Detail
  - mergeOffsets
```
Offset mergeOffsets(PartitionOffset[] offsets)
```
    Merge partitioned offsets coming from ContinuousDataReader instances for each partition to a single global offset.
  - deserializeOffset
```
Offset deserializeOffset(String json)
```
    Deserialize a JSON string into an Offset of the implementation-defined offset type.
    
    Throws:
    
    IllegalArgumentException - if the JSON does not encode a valid offset for this reader
  - setStartOffset
```
void setStartOffset(java.util.Optional<Offset> start)
```
    Set the desired start offset for reader factories created from this reader. The scan will start from the first record after the provided offset, or from an implementation-defined inferred starting point if no offset is provided.
  - getStartOffset
```
Offset getStartOffset()
```
    Return the specified or inferred start offset for this reader.
    
    Throws:
    
    IllegalStateException - if setStartOffset has not been called
  - needsReconfiguration
```
default boolean needsReconfiguration()
```
    The execution engine will call this method in every epoch to determine if new reader factories need to be generated, which may be required if for example the underlying source system has had partitions added or removed. If true, the query will be shut down and restarted with a new reader.
  - commit
```
void commit(Offset end)
```
    Informs the source that Spark has completed processing all data for offsets less than or equal to `end` and will only request offsets greater than `end` in the future.

Interface ContinuousReader

Method Summary

Methods inherited from interface org.apache.spark.sql.execution.streaming.BaseStreamingSource

Methods inherited from interface org.apache.spark.sql.sources.v2.reader.DataSourceReader

Method Detail

mergeOffsets

deserializeOffset

setStartOffset

getStartOffset

needsReconfiguration

commit