[CALCITE-7409] MERGE JOIN condition cannot contain IS NOT DISTINCT FROM by xiedeyantu · Pull Request #4785 · apache/calcite

xiedeyantu · 2026-02-04T16:27:09Z

mihaibudiu · 2026-02-04T19:36:33Z

core/src/main/java/org/apache/calcite/adapter/enumerable/EnumerableMergeJoinRule.java


  @Override public @Nullable RelNode convert(RelNode rel) {
    Join join = (Join) rel;
+    // MergeJoin cannot handle IS NOT DISTINCT FROM because it stops at NULL values


there is another comment about this right below; are both comments necessary?

I'd like to keep these two: one explaining the reason, and the other describing a reasonable approach. However, supporting "IS NOT DISTINCT FROM" would require modifying Linq4j, which doesn't seem straightforward. So, I'll add an extra TODO for now. WDYT?

You can check for a JIRA issue about it, and if there is one add the link.
I think the two comments could be combined into one.

Is it acceptable to change it to the following format?

// TODO: support IS NOT DISTINCT FROM condition as join keys of MergeJoin. // MergeJoin cannot handle IS NOT DISTINCT FROM because it stops at NULL values // while IS NOT DISTINCT FROM treats NULL = NULL as true.

I couldn't find anything similar on Jira, or maybe I'm just not very good at using Jira.

xiedeyantu · 2026-02-04T22:52:43Z

This SQL query actually comes from CALCITE-6452. Jira should currently be working correctly, but due to a cost model issue, it's selecting MergeJoin. From the DAG, it's clear that MergeJoin has about 7 fewer rows than HashJoin, but consumes 6 times more CPU. The current cost model ignores CPU usage, hence selecting MergeJoin. Although the cost model is flawed, this error should still be fixed.

rel#207 (EnumerableHashJoin):
  rows=14.209999999999999
  cost={222.6832026146136 rows, 204.2 cpu, 0.0 io}

rel#213 (EnumerableMergeJoin):
  rows=14.209999999999999
  cost={215.13639999999998 rows, 1272.3104732091813 cpu, 0.0 io}

sonarqubecloud · 2026-02-05T11:54:36Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

mihaibudiu reviewed Feb 4, 2026

View reviewed changes

mihaibudiu approved these changes Feb 4, 2026

View reviewed changes

xiedeyantu force-pushed the CALCITE-7409 branch from 5e0693e to 8d79878 Compare February 5, 2026 11:34

[CALCITE-7409] MERGE JOIN condition cannot contain IS NOT DISTINCT FROM

d4bc742

xiedeyantu force-pushed the CALCITE-7409 branch from 8d79878 to d4bc742 Compare February 5, 2026 11:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CALCITE-7409] MERGE JOIN condition cannot contain IS NOT DISTINCT FROM#4785

[CALCITE-7409] MERGE JOIN condition cannot contain IS NOT DISTINCT FROM#4785
xiedeyantu wants to merge 1 commit intoapache:mainfrom
xiedeyantu:CALCITE-7409

xiedeyantu commented Feb 4, 2026

Uh oh!

mihaibudiu Feb 4, 2026

Uh oh!

xiedeyantu Feb 4, 2026

Uh oh!

mihaibudiu Feb 4, 2026

Uh oh!

xiedeyantu Feb 4, 2026

Uh oh!

xiedeyantu commented Feb 4, 2026

Uh oh!

sonarqubecloud bot commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xiedeyantu commented Feb 4, 2026

Uh oh!

mihaibudiu Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

xiedeyantu Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

mihaibudiu Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

xiedeyantu Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

xiedeyantu commented Feb 4, 2026

Uh oh!

sonarqubecloud bot commented Feb 5, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants