revert: multiprocess kind processing #746

bhearsum · 2025-08-14T17:27:47Z

While this worked very well on Linux, it causes issues anywhere we can't fork to get a new process (most notably on Windows). The problem lies in the fact that in these cases, we spawn an entire new process, which re-imports taskgraph from scratch. This is fine in some cases, but in any case where global state has been modified in an earlier part of TaskGraphGenerator._run, we lose whatever side effects happened there, and end up failing in some way.

Concretely: in gecko we add a bunch of payload_builders as part of registering the graph config. This code doesn't re-run in the spawned processes, so the payload builders don't exist there.

There are workarounds for this: for example, redoing all the earlier work of _run in each subprocess, or perhaps finding some way to ensure all the needed state is passed explicitly. There's no quick and easy way to make this work though, and some thought should be given to the tradeoffs of doing it (vs. doing nothing, or spending the effort on a different way to parallelize) before proceeding.

While this worked very well on Linux, it causes issues anywhere we can't `fork` to get a new process (most notably on Windows). The problem lies in the fact that in these cases, we spawn an entire new process, which re-imports taskgraph from scratch. This is fine in some cases, but in any case where global state has been modified in an earlier part of `TaskGraphGenerator._run`, we lose whatever side effects happened there, and end up failing in some way. Concretely: in gecko we add a bunch of `payload_builders` as part of registering the graph config. This code doesn't re-run in the spawned processes, so the payload builders don't exist there. There are workarounds for this: for example, redoing all the earlier work of `_run` in each subprocess, or perhaps finding some way to ensure all the needed state is passed explicitly. There's no quick and easy way to make this work though, and some thought should be given to the tradeoffs of doing it (vs. doing nothing, or spending the effort on a different way to parallelize) before proceeding.

bhearsum · 2025-08-14T17:28:10Z

This backs out #738 and #744.

Eijebong

:(

bhearsum requested a review from a team as a code owner August 14, 2025 17:27

bhearsum requested a review from hneiva August 14, 2025 17:27

Eijebong approved these changes Aug 14, 2025

View reviewed changes

bhearsum merged commit 0063027 into taskcluster:main Aug 14, 2025
15 checks passed

bhearsum mentioned this pull request Aug 14, 2025

Generate kinds concurrently or in parallel #5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

revert: multiprocess kind processing #746

revert: multiprocess kind processing #746

Uh oh!

bhearsum commented Aug 14, 2025

Uh oh!

bhearsum commented Aug 14, 2025

Uh oh!

Eijebong left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

revert: multiprocess kind processing #746

revert: multiprocess kind processing #746

Uh oh!

Conversation

bhearsum commented Aug 14, 2025

Uh oh!

bhearsum commented Aug 14, 2025

Uh oh!

Eijebong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants