feat(taskbroker): Batch Status Updates by george-sentry · Pull Request #618 · getsentry/taskbroker

george-sentry · 2026-04-30T21:02:08Z

Linear

Description

On the usual workload of 100 millisecond tasks, with the new "claimed" status, we can do around 5K tasks per second in the sandbox. By batching status updates, we reduce DB load, making all queries take less time. This can increase throughput by 1K to 2K tasks per second.

linear-code · 2026-04-30T21:02:11Z

STREAM-918 Batch Status Updates

STREAM-919 Delete Completed Tasks Immediately

george-sentry · 2026-05-01T00:45:30Z

Since we may want to treat claimed → processing updates the same way, I'm actually going to create a more general Flusher struct that can be used by both push threads and the gRPC server.

…eorge/push-taskbroker/batch-updates

george-sentry · 2026-05-01T08:10:27Z

+/// Run flusher that receives values of type T from a channel and flushes
+/// them using the provided async `flush` function either when the batch is
+/// full or when the max flush interval has elapsed.
+pub async fn run_flusher<T, F>(


I created this function because I'm also planning to batch claimed → processing updates in the push pool, which will use basically identical machinery.

fpacifici · 2026-05-01T23:56:57Z

+                }
+            }
+
+            _ = interval.tick() => {


When does this trigger ?

This code now lives in flusher.rs, but it contains a similar loop. This condition triggers every interval_ms after the previous tick.

The flusher only handles the tick when select! actually chooses this branch. If messages keep arriving and the rx.recv() arm keeps winning before the tick is ready, the tick still advances in the background. When the tick is ready and this arm is selected, the buffer is flushed.

fpacifici · 2026-05-02T00:01:01Z

+            for id in ids {
+                buffer.push((id, status));
+            }


Let's say there is a DB issue, would we keep appending to the buffer indefinitely? I think we should add a limit after which we stop and retry on the DB.

This was actually dead code, but similar logic now lives elsewhere.

No, we only append to the buffer while it hasn't reached the desired batch size. So if there's a DB issue, here's what should happen.

Timer runs out or buffer fills up → call flush (this function)

As long as flush is running, the (now empty) buffer does not receive any more IDs

Flush fails because store is unresponsive or some other problem

IDs are pushed back onto the buffer (which was emptied right before attempting the flush)

Flush function exits

So if the DB has a problem, we will keep retrying the same batch of IDs over and over again until it succeeds.

evanh · 2026-05-04T15:22:30Z

+
+pub type StatusUpdate = (String, InflightActivationStatus);
+
+pub async fn flush_status_updates(


I don't think this function should be in the server file. This feels like maybe it should be part of the store itself.

Sure. In its own file within the store module, or within an existing file?

evanh · 2026-05-04T15:39:34Z

+                    Some(v) => {
+                        buffer.push(v);
+
+                        while let Ok(update) = rx.try_recv() {


Why do this again here? Won't this update get processed on the next loop?

Yes, but in theory, it'll be slower if we wait for the next loop iteration, since it'll need to be reawakened. The idea is, while we're already awake, we might as well empty the whole channel rather than going back to sleep just to be awakened a moment later for the next item in the channel.

However, I think there was a bug here that could result in this loop spinning for a very long time. I fixed it by adding a while buffer.len() < batch_size condition.

evanh · 2026-05-04T15:46:44Z

+        let handle = tokio::spawn(async move {
+            flusher::run_flusher(
+                rx,
+                flusher_config.status_flush_batch_size,


I'm not sure this is right. Won't this create another listener on the channel? Does that mean we will get duplicates?

When we are shutting down presumably we won't get a full batch. That means we are waiting at least status_flush_internal_ms before the application can shut down safely. I would be inclined to have this value be much lower when we are shutting down to try and clear the batch as quickly as possible.

No, rx here is the only listener on this channel as it's an MPSC channel. And in this case, there is also only one writer - the gRPC server.

…eorge/push-taskbroker/batch-updates

Co-authored-by: Markus Unterwaditzer <markus-github@unterwaditzer.net>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 25dce9e. Configure here.}

….com/getsentry/taskbroker into george/push-taskbroker/batch-updates

Batch Status Updates and Delete Completed Tasks Immediately

e707b7e

george-sentry requested a review from a team as a code owner April 30, 2026 21:02

sentry Bot reviewed Apr 30, 2026

View reviewed changes

Comment thread src/main.rs Outdated

Comment thread src/main.rs Outdated

cursor Bot reviewed Apr 30, 2026

View reviewed changes

Comment thread src/main.rs Outdated

Comment thread src/main.rs Outdated

Comment thread src/main.rs Outdated

george-sentry marked this pull request as draft April 30, 2026 23:24

Create Generic Flusher Function

b7ef805

george-sentry marked this pull request as ready for review May 1, 2026 08:02

george-sentry added 2 commits May 1, 2026 01:05

Fix Copy/Paste Error in Log Messages

a2fab2c

Merge branch 'main' of https://github.com/getsentry/taskbroker into g…

c96955e

…eorge/push-taskbroker/batch-updates

sentry Bot reviewed May 1, 2026

View reviewed changes

Comment thread src/grpc/server.rs

Comment thread src/store/adapters/sqlite.rs Outdated

george-sentry commented May 1, 2026

View reviewed changes

cursor Bot reviewed May 1, 2026

View reviewed changes

Comment thread src/grpc/server.rs

Comment thread src/grpc/status_flusher.rs Outdated

markstory reviewed May 1, 2026

View reviewed changes

Comment thread src/store/adapters/sqlite.rs Outdated

Comment thread src/store/adapters/sqlite.rs Outdated

Comment thread src/store/adapters/postgres.rs Outdated

Comment thread src/store/adapters/postgres.rs Outdated

george-sentry changed the title ~~feat(taskbroker): Batch Status Updates and Delete Completed Tasks Immediately~~ feat(taskbroker): Batch Status Updates May 1, 2026

fpacifici reviewed May 2, 2026

View reviewed changes

evanh reviewed May 4, 2026

View reviewed changes

Various Changes

1e79bfc

sentry Bot reviewed May 4, 2026

View reviewed changes

Comment thread src/main.rs

Add Tests for Batching

5c60034

sentry Bot reviewed May 7, 2026

View reviewed changes

Comment thread src/main.rs Outdated

george-sentry added 2 commits May 11, 2026 07:20

Merge branch 'main' of https://github.com/getsentry/taskbroker into g…

f46c1f2

…eorge/push-taskbroker/batch-updates

Add Switch for Batching Status Updates

a932f39

sentry Bot reviewed May 11, 2026

View reviewed changes

Comment thread src/grpc/server.rs

untitaker reviewed May 11, 2026

View reviewed changes

Comment thread src/fetch/tests.rs Outdated

Comment thread src/grpc/server.rs Outdated

george-sentry and others added 2 commits May 11, 2026 09:15

Minor Tests Change

2e77b13

Minor Batch Processing Edit

25dce9e

Co-authored-by: Markus Unterwaditzer <markus-github@unterwaditzer.net>

sentry Bot reviewed May 11, 2026

View reviewed changes

Comment thread src/grpc/server.rs

cursor Bot reviewed May 11, 2026

View reviewed changes

Comment thread src/main.rs Outdated

george-sentry added 2 commits May 11, 2026 09:26

Ensure Update Channel Size ≥ 1

37c336c

Merge branch 'george/push-taskbroker/batch-updates' of https://github…

e719dd3

….com/getsentry/taskbroker into george/push-taskbroker/batch-updates


		pub type StatusUpdate = (String, InflightActivationStatus);

		pub async fn flush_status_updates(

Uh oh!

Conversation

george-sentry commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Linear

Description

Uh oh!

linear-code Bot commented Apr 30, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

george-sentry commented May 1, 2026

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

george-sentry commented Apr 30, 2026 •

edited

Loading