feat(streams): Add backpressure metrics for consumer strategies by fpacifici · Pull Request #288 · getsentry/streams

fpacifici · 2026-03-30T21:40:47Z

Rust arroyo does not record metrics for backpressure unless the exception is propagated back to the consumer.
This PR adds metrics to record backpressure events no matter where they happen together with a metrics that tracks how long a step is exerting backpressure for.

This is the first step to improve the throughput metrics.

Refactor how metrics are initialized. The initializer we have is for arroyo metrics. We were not recording anything from the rust code. Added unit tests as well.
Added a BackpressureTracker that keeps track of the backpressure state of every step and wraps an Arroyo strategy to intercept backrpessure events.
Allow the python code to specify the name of a step to the rust code (used to populate tags).

What would come next:

buffered metrics to impact performance less
success rate to compute the rate between success and backpressure
more metrics recorders (log based to record periodically stats)

Made with Cursor

Instrument ProcessingStrategy steps with counters and episode histograms for MessageRejected: send_backpressure vs receive_backpressure and matching duration series, using a BackpressureNext wrapper and explicit tracking for StreamSink produce and PythonAdapter. Each metric is labeled by step (operator kind and route). Incomplete episodes at shutdown are not emitted. Co-Authored-By: Cursor <cursoragent@cursor.com> Made-with: Cursor

github-actions · 2026-03-30T21:41:02Z

Semver Impact of This PR

🟡 Minor (new features)

📋 Changelog Preview

This is how your changes will appear in the changelog.
Entries from this PR are highlighted with a left border (blockquote style).

New Features ✨

(streams) Add backpressure metrics for consumer strategies by fpacifici in #288

Internal Changes 🔧

Deps

Bump pygments from 2.19.2 to 2.20.0 in /sentry_streams by dependabot in #286
Bump requests from 2.32.4 to 2.33.0 in /sentry_streams by dependabot in #287

_{🤖 This preview updates automatically when you update the PR.}

sentry_streams/src/operators.rs

sentry_streams/src/backpressure_metrics.rs

Use DSL Step.name for the step label: add_step(step, step_name) on the Rust consumer, parallel step_names in build_chain, and wire names from rust_arroyo (including segment_label for chained maps). Rename EpisodeTracker to BackpressureTracker. Document BackpressureNext::poll vs submit for MessageRejected. Co-Authored-By: Cursor <cursoragent@cursor.com> Made-with: Cursor

Move pipeline step labels into RuntimeOperator so consumer assembly no longer tracks a parallel step_names list. This lets Arroyo segment finalization pass a single first-step label through operator construction for backpressure metrics. Co-Authored-By: GPT-5 Codex <noreply@openai.com> Made-with: Cursor

Route backpressure counters and duration histograms through STREAMS_RECORDER and shared streams.pipeline helpers. Restore DogStatsD global labels on the exporter. Add unit tests using metrics with_local_recorder plus DebuggingRecorder, a merged StreamsMetricsRecorder test, and BackpressureNext wired to FakeStrategy for MessageRejected. Co-Authored-By: GPT-5 Codex <noreply@openai.com> Made-with: Cursor

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-31T10:59:41Z

sentry_streams/src/sinks.rs

-                Ok(_) => {}
+                Ok(_) => {
+                    recv_on_success(label, &mut self.produce_recv_tracker);
+                }


Missing send_on_success in poll inflates backpressure duration

Medium Severity

When StreamSink::poll successfully retries the carried-over message, recv_on_success is called for produce_recv_tracker but send_on_success is never called for send_tracker. The send_tracker timer started when a prior submit returned MessageRejected (due to message_carried_over being Some). Once poll clears the carried-over message, the step can accept messages again, but the timer keeps running until the next successful submit call. This inflates send_backpressure_duration by the time between the poll clearing the backlog and the next incoming message, which can be significant in low-throughput scenarios. Adding send_on_success(label, &mut self.send_tracker) alongside recv_on_success in the Ok(_) branch of poll would fix the measurement.

Additional Locations (1)

sentry_streams/src/sinks.rs#L265-L266

evanh · 2026-03-31T14:24:54Z

sentry_streams/src/backpressure_metrics.rs

+}
+
+/// End a receive backpressure event after downstream accepted a submit.
+pub fn recv_on_success(step: &str, tracker: &mut BackpressureTracker) {


Why is this a separate function? Why not called the tracker directly in the BackpressureNext class?

fpacifici commented Mar 30, 2026

View reviewed changes

sentry_streams/src/operators.rs Outdated Show resolved Hide resolved

sentry_streams/src/backpressure_metrics.rs Outdated Show resolved Hide resolved

fpacifici and others added 2 commits March 31, 2026 00:30

fpacifici force-pushed the fpacifici/add_backpressure branch from 7190cbe to 8f9c8ef Compare March 31, 2026 10:05

fpacifici marked this pull request as ready for review March 31, 2026 10:50

fpacifici requested a review from a team as a code owner March 31, 2026 10:50

cursor bot reviewed Mar 31, 2026

View reviewed changes

evanh reviewed Mar 31, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(streams): Add backpressure metrics for consumer strategies#288

feat(streams): Add backpressure metrics for consumer strategies#288
fpacifici wants to merge 4 commits intomainfrom
fpacifici/add_backpressure

fpacifici commented Mar 30, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 30, 2026 •

edited

Loading

New Features ✨

Internal Changes 🔧

Deps

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Mar 31, 2026

Uh oh!

evanh Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

fpacifici commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Semver Impact of This PR

New Features ✨

Internal Changes 🔧

Deps

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 31, 2026

Choose a reason for hiding this comment

Missing send_on_success in poll inflates backpressure duration

Uh oh!

evanh Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fpacifici commented Mar 30, 2026 •

edited

Loading

github-actions bot commented Mar 30, 2026 •

edited

Loading