feat(taskbroker): Add Sending Status to Handle Push Failures by george-sentry · Pull Request #586 · getsentry/taskbroker

george-sentry · 2026-04-02T22:03:01Z

Linear

Description

Currently, taskworkers pull tasks from taskbrokers via RPC. This approach works, but has some drawbacks. Therefore, we want taskbrokers to push tasks to taskworkers instead. Read this page on Notion for more information.

Right now, I rely on processing_deadline to revert processing tasks back to pending if pushing them failed. This isn't good because it eats through processing attempts, resulting in needlessly dropped tasks.

I want to add a Sending status that indicates a task is being sent. Now, upkeep increments processing attempts only for tasks that are still in "sending" when their processing deadlines expire. If the status is "processing," that means the task was already sent successfully and its processing attempts can be incremented.

This will help us avoid dropping tasks needlessly when workers are busy.

Note that my original plan was different. You can see it in the commit history. Here is a description of that plan.

I want to add a sent column to the activation table to track whether a task was successfully sent after being fetched from the table. Now, upkeep increments processing attempts only for tasks that are processing and have sent = true.

If the status is processing and sent = false, that means pushing failed or timed out (or didn't happen yet), and we can revert back to pending without incrementing processing attempts.

linear-code · 2026-04-02T22:03:04Z

STREAM-860 Add Sent Flag to Handle Push Failures

src/upkeep.rs

src/push/mod.rs

benches/store_bench.rs

src/grpc/server.rs

pg_migrations/0001_create_inflight_activations.sql

src/store/inflight_activation.rs

src/upkeep.rs

src/store/inflight_activation.rs

src/upkeep.rs

src/store/inflight_activation.rs

evanh · 2026-04-08T19:49:32Z

src/grpc/server.rs

-
-            Ok(activations) => {
-                let inflight = &activations[0];
+            // If we return an error, the worker will place the result back in its internal queue and send the update again in the future, which is not desired


We should still catch these cases and at least log an error.

Added logging here!

…vations

src/upkeep.rs

src/store/inflight_activation.rs

src/upkeep.rs

…tivations Marked Sent

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 454e0c7. Configure here.}

src/store/postgres_activation_store.rs

sentry · 2026-04-09T21:04:00Z

src/store/postgres_activation_store.rs

        .bind(InflightActivationStatus::Failure.to_string())
        .bind(now)
        .bind(InflightActivationStatus::Processing.to_string())
+        .bind(InflightActivationStatus::Sending.to_string())


Bug: At-most-once (AMO) tasks in Sending status are moved to Failure when their processing deadline expires, even though they were never delivered to a worker.
_{Severity: MEDIUM}

Suggested Fix

Modify the logic for handling expired AMO tasks in the Sending status. Instead of transitioning them to Failure, revert them to Pending status without incrementing the processing attempt counter. This aligns their behavior with non-AMO tasks in the same state and allows for a retry without violating at-most-once guarantees.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/store/postgres_activation_store.rs#L555-L558 Potential issue: When an at-most-once (AMO) task is claimed for processing, it enters a `Sending` status. If the task's delivery to a worker is not confirmed before the processing deadline expires, the current logic moves the task directly to a `Failure` state. This wastes the single execution attempt for a task that was never actually run by any worker. This contradicts the goal of preventing unnecessary task drops, as the task could be safely retried without violating AMO semantics. The expected behavior for an unsent AMO task would be to revert it to `Pending` status, allowing another delivery attempt.

This is intentional. It is possible to deliver a task to a worker successfully but observe a send failure due to downstream network issues. For example, the response may not reach the taskbroker before the request times out if the network is congested. If I used the AI's suggestion in this scenario, the task would be mistakenly retried, resulting in more than one execution (a clear violation of the at-most-once policy).

Add Sent Flag to Prevent Dropping Tasks on Push Failure

358edc1

george-sentry requested a review from a team as a code owner April 2, 2026 22:03

sentry bot reviewed Apr 2, 2026

View reviewed changes

src/upkeep.rs Show resolved Hide resolved

src/push/mod.rs Show resolved Hide resolved

cursor bot reviewed Apr 2, 2026

View reviewed changes

benches/store_bench.rs Outdated Show resolved Hide resolved

Add Metrics for Processing Deadline Resets, Fix AI Reviewer Bugs

7084a24

sentry bot reviewed Apr 3, 2026

View reviewed changes

src/grpc/server.rs Show resolved Hide resolved

cursor bot reviewed Apr 3, 2026

View reviewed changes

pg_migrations/0001_create_inflight_activations.sql Outdated Show resolved Hide resolved

Split Postgres Changes into Migrations

1d248a1

sentry bot reviewed Apr 3, 2026

View reviewed changes

src/store/inflight_activation.rs Outdated Show resolved Hide resolved

cursor bot reviewed Apr 3, 2026

View reviewed changes

src/store/inflight_activation.rs Outdated Show resolved Hide resolved

Handle Claim One Invariant Gracefully

688dc04

sentry bot reviewed Apr 3, 2026

View reviewed changes

src/store/inflight_activation.rs Show resolved Hide resolved

src/upkeep.rs Show resolved Hide resolved

Replace Sent Flag w/Sending Status

56d2efb

george-sentry changed the title ~~feat(taskbroker): Add Sent Flag to Prevent Dropping Tasks on Push Failure~~ feat(taskbroker): Add Sending Status to Handle Push Failures Apr 7, 2026

sentry bot reviewed Apr 7, 2026

View reviewed changes

src/store/inflight_activation.rs Show resolved Hide resolved

src/upkeep.rs Show resolved Hide resolved

cursor bot reviewed Apr 7, 2026

View reviewed changes

src/store/inflight_activation.rs Show resolved Hide resolved

evanh reviewed Apr 8, 2026

View reviewed changes

Mark Demoted Namespace Tasks as Sending, Log Error on No Pending Acti…

a819ea5

…vations

sentry bot reviewed Apr 9, 2026

View reviewed changes

src/upkeep.rs Show resolved Hide resolved

cursor bot reviewed Apr 9, 2026

View reviewed changes

src/upkeep.rs Show resolved Hide resolved

src/store/inflight_activation.rs Show resolved Hide resolved

Emit Metrics for Sending Tasks

991c4ee

cursor bot reviewed Apr 9, 2026

View reviewed changes

src/upkeep.rs Show resolved Hide resolved

Add Sending Count to UpkeepResults Empty Calculation, Warn on No Ac…

454e0c7

…tivations Marked Sent

cursor bot reviewed Apr 9, 2026

View reviewed changes

src/store/postgres_activation_store.rs Show resolved Hide resolved

george-sentry added 2 commits April 9, 2026 13:51

Add Rows Affected Check to PSQL Mark Sent, Fix Unit Tests

646242f

Merge branch 'main' into george/push-taskbroker/add-sent-flag

fa4012b

sentry bot reviewed Apr 9, 2026

View reviewed changes

george-sentry requested a review from evanh April 9, 2026 21:10

Uh oh!

Conversation

george-sentry commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Linear

Description

Uh oh!

linear-code bot commented Apr 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

evanh Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

george-sentry Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sentry bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

george-sentry Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

george-sentry commented Apr 2, 2026 •

edited

Loading