perf: Add microbenchmark for hash expressions #3028

andygrove · 2026-01-02T19:08:05Z

Which issue does this PR close?

Closes #.

Rationale for this change

What changes are included in this PR?

How are these changes tested?

codecov-commenter · 2026-01-02T19:31:21Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 59.55%. Comparing base (f09f8af) to head (cec2982).
⚠️ Report is 812 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #3028      +/-   ##
============================================
+ Coverage     56.12%   59.55%   +3.43%     
- Complexity      976     1379     +403     
============================================
  Files           119      167      +48     
  Lines         11743    15496    +3753     
  Branches       2251     2569     +318     
============================================
+ Hits           6591     9229    +2638     
- Misses         4012     4970     +958     
- Partials       1140     1297     +157

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

mbutrovich

LGTM, minor comment request.

mbutrovich · 2026-01-06T15:47:19Z

spark/src/test/scala/org/apache/spark/sql/benchmark/CometHashExpressionBenchmark.scala

+            dir,
+            spark.sql(s"""
+              SELECT
+                CASE WHEN value % 100 = 0 THEN NULL ELSE CONCAT('string_', CAST(value AS STRING)) END AS c_str,


Same comment as in #3026

Would you add a succinct comment that gives a high level summary of the data distribution you're trying to create? We can certainly read through the CASE WHEN logic, but ... it's not obvious what the underlying values and math is hard.

mbutrovich

Thanks @andygrove!

andygrove added 2 commits January 2, 2026 12:07

add microbenchmark for hash expressions

48cc37b

skip some CI workflows for benchmark changes

9af4184

andygrove added 5 commits January 2, 2026 15:14

skip failing suite

6869f79

Merge branch 'skip-ci-bench' into hash-bench

b1ed4d3

skip more workflows on benchmark PRs

c213912

Merge branch 'skip-more-workflows-on-benchmark-prs' into hash-bench

cec2982

Merge remote-tracking branch 'apache/main' into hash-bench

ece06ef

mbutrovich requested changes Jan 6, 2026

View reviewed changes

address feedback

828a455

mbutrovich approved these changes Jan 6, 2026

View reviewed changes

andygrove merged commit 092d88c into apache:main Jan 6, 2026
1 check passed

andygrove deleted the hash-bench branch January 6, 2026 19:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: Add microbenchmark for hash expressions #3028

perf: Add microbenchmark for hash expressions #3028

andygrove commented Jan 2, 2026

Uh oh!

codecov-commenter commented Jan 2, 2026 •

edited

Loading

Uh oh!

mbutrovich left a comment

Uh oh!

mbutrovich Jan 6, 2026

Uh oh!

mbutrovich left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perf: Add microbenchmark for hash expressions #3028

perf: Add microbenchmark for hash expressions #3028

Conversation

andygrove commented Jan 2, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

codecov-commenter commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mbutrovich left a comment

Choose a reason for hiding this comment

Uh oh!

mbutrovich Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

mbutrovich left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Jan 2, 2026 •

edited

Loading