[Spark] Spark streaming rate limit by addu390 · Pull Request #2776 · apache/fluss

addu390 · 2026-03-02T16:12:15Z

Linked issue: close #2550

Add rate limit support for Spark streaming reads to control the number of offsets processed per micro-batch trigger.

Added scan.max.offsets.per.trigger, scan.min.offsets.per.trigger, and scan.max.trigger.delay config options in SparkFlussConf
Override getDefaultReadLimit in FlussMicroBatchStream to return appropriate ReadLimit based on config
Note: Offset capping uses proportional fair-share distribution across buckets. A simpler, more typical approach (maxOffsets / numBuckets) can be used instead, if that's preferred.

New user-facing config options for Spark DataFrameReader:

N/A, documentation update to be tracked separately.

addu390 added 2 commits March 1, 2026 12:02

[spark] Add rate limit support for streaming read

7b6d456

Add comment to explain why earliest is used in test case

5d14803

addu390 changed the title ~~Spark streaming rate limit~~ [Spark] Spark streaming rate limit Mar 2, 2026

Provide feedback