Package org.apache.spark.sql.connector.read.streaming
package org.apache.spark.sql.connector.read.streaming
-
ClassDescriptionIndicates that the source accepts the latest seen offset, which requires streaming execution to provide the latest seen offset when restarting the streaming query from checkpoint./** Represents a
ReadLimitwhere theMicroBatchStreamshould scan approximately given maximum number of rows with at least the given minimum number of rows.A variation onPartitionReaderfor use with continuous streaming processing.A variation onPartitionReaderFactorythat returnsContinuousPartitionReaderinstead ofPartitionReader.ASparkDataStreamfor streaming queries with continuous mode.ASparkDataStreamfor streaming queries with micro-batch mode.An abstract representation of progress through aMicroBatchStreamorContinuousStream.Used for per-partition offsets in continuous processing.Represents aReadLimitwhere theMicroBatchStreammust scan all the data available at the streaming source.Interface representing limits on how much to read from aMicroBatchStreamwhen it implementsSupportsAdmissionControl.Represents aReadLimitwhere theMicroBatchStreamshould scan files which total size doesn't go beyond a given maximum total size.Represents aReadLimitwhere theMicroBatchStreamshould scan approximately the given maximum number of files.Represents aReadLimitwhere theMicroBatchStreamshould scan approximately the given maximum number of rows.Represents aReadLimitwhere theMicroBatchStreamshould scan approximately at least the given minimum number of rows.A mix-in interface for streaming sinks to signal that they can report metrics.A mix-in interface forSparkDataStreamstreaming sources to signal that they can report metrics.The base interface representing a readable data stream in a Spark streaming query.A mix-in interface forSparkDataStreamstreaming sources to signal that they can control the rate of data ingested into the system.An interface for streaming sources that supports running in Trigger.AvailableNow mode, which will process all the available data at the beginning of the query in (possibly) multiple batches.