feat(benchmarks): add BAL benchmarks for the optimization strategies introduced by BALs by jochem-brouwer · Pull Request #2197 · ethereum/execution-specs

jochem-brouwer · 2026-02-12T00:42:48Z

🗒️ Description

This PR adds BAL tests, for more information and background see jochem-brouwer#1

Huge thanks to @fselmo for refactoring and adding tests!

🔗 Related Issues or PRs

N/A.

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx tox -e static
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.
Tests: For PRs implementing a missed test case, update the post-mortem document to add an entry the list.
Ported Tests: All converted JSON/YML tests from ethereum/tests or tests/static have been assigned @ported_from marker.

Cute Animal Picture

- test_prefetch_cold_storage: Cold SLOAD workload with sequential and hash-chain scattered access patterns. The scattered pattern is unpredictable without BAL but trivially prefetchable with one. - test_coinbase_serialization: Disjoint contracts where coinbase fee accumulation is the only shared state. Implicit (fees) and explicit (CALL to coinbase) variants. - test_deploy_then_interact: Deploy/call tx pairs in a single block. Independent pairs (parallelizable) and single-contract (serial) variants. - test_mixed_dependency_graph: Interleaved groups of serial keccak chains. Group sizes 1/2/5 control available parallelism.

LouisTsai-Csie · 2026-02-24T07:48:57Z

Hi @jochem-brouwer, based on your BAL benchmark description, I put together a small refactor below. I’ve run it locally and it passes.

@pytest.mark.valid_from("Amsterdam")
def test_tx_dependency(
    benchmark_test: BenchmarkTestFiller,
    pre: Alloc,
    fork: Fork,
) -> None:
    """
    Benchmark BAL with transaction-dependent execution.

    Deploy a contract that reads storage slot 0, computes a
    keccak256 hash chain until gas is nearly exhausted, and writes
    the result back. Each transaction depends on the previous
    one's storage write, preventing parallel execution.
    """
    target_slot = 0

    setup = Op.MSTORE(
        0,
        Op.SLOAD(target_slot, key_warm=False),
        # gas accounting
        old_memory_size=0,
        new_memory_size=32,
    )

    loop_body = Op.MSTORE(
        0,
        Op.SHA3(0, 32, data_size=32),
        # gas accounting
        old_memory_size=32,
        new_memory_size=32,
    )

    cleanup = (
        Op.SSTORE(
            target_slot,
            Op.MLOAD(0),
            # gas accounting
            key_warm=True,
            original_value=1,
            current_value=1,
            new_value=2,
        )
    )

    reserve_gas = cleanup.gas_cost(fork) + 50
    condition = Op.GT(Op.GAS, reserve_gas)

    attack_code = setup + While(body=loop_body, condition=condition) + cleanup

    attack_contract = pre.deploy_contract(
        code=attack_code,
        storage={0: 1},
    )

    benchmark_test(
        tx=Transaction(
            to=attack_contract,
            sender=pre.fund_eoa(),
        ),
        skip_gas_used_validation=True,
    )

Each transaction starts by reading slot 0, continues the hash chain, and then stores the final result back to slot 0. I didn’t change the overall design, but I simplified the per-transaction logic since our benchmark test wrapper already generates multiple transactions while respecting tx gas limit cap.

fselmo · 2026-02-24T17:03:03Z

@LouisTsai-Csie can you please help review jochem-brouwer#1 when you get a chance 👀? It builds on the TODOs here and adds more test cases. If it's not far off and only needs tweaks, I think we can at least merge it and work on subsequent updates here so that we have a more complete picture PR'd to ethereum/execution-specs, and since this is still in Draft. Wdyt?

raxhvl · 2026-03-05T13:53:36Z

+
+    condition = Op.GT(Op.GAS, reserve_gas)
+
+    loop = While(body=keccak_body, condition=condition)


issue(perf, non-blocking): Loop wastes 5 gas per cycle

The while loop compiles to:

[setup][JUMPDEST][body][condition][compute jumpdest][JUMPI][cleanup]

and the jumpdest at runtime using an offset:

JUMPDEST ← we want to jump back here body condition PUSH4 <offset> ← "how far back is the JUMPDEST?" PC ← "where am I right now?" SUB ← PC - offset = JUMPDEST address JUMPI

So PC + SUB (= 5 gas) is a workaround ["I don't know where I am in absolute terms, but I know how far back to jump."]

i.e: the root cause is: While is not aware of setup.

Suggestion

Introduce an optional setup code for While generator ( = for loop primitive) so jumpdest can be computed at compile-time JUMPDEST = len(setup):

setup JUMPDEST body condition PUSH1 len(setup) JUMPI

cc: @marioevz @LouisTsai-Csie

This is a good point and it looks like this is part of the definition of While. I think it's possible to generalize this idea and thus to remove the calculate-jumpdest-at-runtime for all while loops. (Should be addressed in a refactor at some point)

Refactor/bal benchmarks

jochem-brouwer · 2026-03-17T00:29:22Z

I have tested against EthJS to verify that the tests fill and added some minor changes. The tests fill, and on a quick inspection it also looks like they execute the expected behavior (loops without OOGs).

I think we can merge this one for BAL-specific benchmarks.

codecov · 2026-03-17T01:08:47Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 86.01%. Comparing base (be3678d) to head (2dfb2fb).
⚠️ Report is 170 commits behind head on forks/amsterdam.

Additional details and impacted files

@@                 Coverage Diff                 @@
##           forks/amsterdam    #2197      +/-   ##
===================================================
- Coverage            86.07%   86.01%   -0.07%     
===================================================
  Files                  599      599              
  Lines                39472    36904    -2568     
  Branches              3780     3771       -9     
===================================================
- Hits                 33977    31744    -2233     
+ Misses                4862     4551     -311     
+ Partials               633      609      -24

Flag	Coverage Δ
unittests	`86.01% <ø> (-0.07%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

LouisTsai-Csie · 2026-03-17T08:26:15Z

+    body_gas = body.gas_cost(fork)
+    placeholder = Op.GT(Op.GAS, Op.PUSH1(0))
+    per_iter_gas = While(body=body, condition=placeholder).gas_cost(fork)
+    exit_overhead = per_iter_gas - body_gas - Op.JUMPDEST.gas_cost(fork)


Just leave a note here, i hope PR #2103 could help when it is merged

fselmo

This lgtm and we should ideally get this in now so going to merge :). @LouisTsai-Csie you mentioned you'd like to refactor here. Can you create an issue to track the refactor? Going to ping you here so you do not forget :)

* ✨ feat(tests): EIP-7928 SELFDESTRUCT tests * feat: point to latest commit in BALs specs (resolver) * feat: Validate t8n BAL does not have duplicate entries for the same tx_index * 📄 docs: Changelog entry * chore: avoid extra fields in BAL classes, related to ethereum#2197 * Add tests for EIP-7928 around precompiles (doc) * fix(tests): Fix expectations for self-destruct tests --------- Co-authored-by: raxhvl <raxhvl@users.noreply.github.com> Co-authored-by: fselmo <fselmo2@gmail.com> Co-authored-by: Toni Wahrstätter <51536394+nerolation@users.noreply.github.com>

feat(tests): add hash chain test to test parallel execution benchs

33f61ec

fselmo self-assigned this Feb 17, 2026

fselmo self-requested a review February 17, 2026 14:40

fselmo added 8 commits February 17, 2026 16:33

refactor: move test file to compute/scenario/

48e31d8

refactor: clean up, address TODOs, parametrize more evenly

3498c0b

feat(test): Add initial state root computation benchmark test for BALs

5ca7859

refactor(test): use gas_cost as appropriately as we can for costs

2ce05cb

refactor(test): clean up, DRY, and refactor

90fbc4d

fix(test): updates from comments

28a68c4

refactor: move to agreed upon path for BAL benchmarks

ac4806c

fselmo mentioned this pull request Feb 24, 2026

feat: Build on TODOs and add more BAL benchmark tests jochem-brouwer/execution-specs#1

Merged

8 tasks

raxhvl reviewed Mar 5, 2026

View reviewed changes

jochem-brouwer added 5 commits March 17, 2026 00:56

fix(benchmarks): add reference dummy

640df71

fix(benchmarks): skip test in cases where gas limit is too low

abab0af

fix(benchmarks): make ruff happy

21f3662

fix(benchmarks): make mypy happy

f97e534

Merge pull request #2 from jochem-brouwer/refactor/bal-benchmarks

2dfb2fb

Refactor/bal benchmarks

jochem-brouwer changed the title ~~feat(tests): add hash chain test to test parallel execution benchs~~ feat(benchmarks): add BAL benchmarks for the optimization strategies introduced by BALs Mar 17, 2026

jochem-brouwer marked this pull request as ready for review March 17, 2026 00:28

raxhvl mentioned this pull request Mar 17, 2026

perf: Improve gas consumption of While loop #2519

Open

LouisTsai-Csie self-requested a review March 17, 2026 06:52

LouisTsai-Csie reviewed Mar 17, 2026

View reviewed changes

fselmo approved these changes Mar 17, 2026

View reviewed changes

fselmo merged commit 905db26 into ethereum:forks/amsterdam Mar 17, 2026
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(benchmarks): add BAL benchmarks for the optimization strategies introduced by BALs#2197

feat(benchmarks): add BAL benchmarks for the optimization strategies introduced by BALs#2197
fselmo merged 14 commits intoethereum:forks/amsterdamfrom
jochem-brouwer:bal-benchmarks

jochem-brouwer commented Feb 12, 2026 •

edited

Loading

Uh oh!

LouisTsai-Csie commented Feb 24, 2026

Uh oh!

fselmo commented Feb 24, 2026 •

edited

Loading

Uh oh!

raxhvl Mar 5, 2026

Uh oh!

jochem-brouwer Mar 17, 2026

Uh oh!

jochem-brouwer commented Mar 17, 2026

Uh oh!

codecov Bot commented Mar 17, 2026

Uh oh!

LouisTsai-Csie Mar 17, 2026

Uh oh!

fselmo left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		condition = Op.GT(Op.GAS, reserve_gas)

		loop = While(body=keccak_body, condition=condition)

Conversation

jochem-brouwer commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🗒️ Description

🔗 Related Issues or PRs

✅ Checklist

Cute Animal Picture

Uh oh!

LouisTsai-Csie commented Feb 24, 2026

Uh oh!

fselmo commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

raxhvl Mar 5, 2026

Choose a reason for hiding this comment

issue(perf, non-blocking): Loop wastes 5 gas per cycle

Suggestion

Uh oh!

jochem-brouwer Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

jochem-brouwer commented Mar 17, 2026

Uh oh!

codecov Bot commented Mar 17, 2026

Codecov Report

Uh oh!

LouisTsai-Csie Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

fselmo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jochem-brouwer commented Feb 12, 2026 •

edited

Loading

fselmo commented Feb 24, 2026 •

edited

Loading