Handle extra semicolons after import statements by saranshflip · Pull Request #6323 · openrewrite/rewrite

saranshflip · 2025-11-22T05:59:03Z

What's changed?

Added debug test to investigate Java parser behavior with consecutive semicolons in import statements. Enhanced parser to detect and consume extra semicolons following import statements, storing them in the whitespace for accurate source representation.

What's your motivation?

Investigating a parsing issue where consecutive semicolons in import statements (e.g., import java.util.*;;) cause parser failures or generate incorrect LST with J.Erroneous nodes. This test helps identify the root cause and validates potential fixes.

Anything in particular you'd like reviewers to focus on?

Current Status: The parser now bypasses the parsing error and generates an LST tree, but the tree contains J.Erroneous nodes which indicates the fix is incomplete.

Key concerns:

The LST structure is not correctly representing the consecutive semicolons
J.Erroneous nodes suggest the parser is still struggling with this syntax
Need guidance on proper tree generation for this edge case

Would appreciate feedback on:

Correct approach to handle extra semicolons in the grammar/parser
How to properly store consecutive semicolons in the LST without creating erroneous nodes
Whether the whitespace approach is the right strategy

Anyone you would like to review specifically?

Looking for maintainers familiar with the Java parser implementation and LST tree generation.

Have you considered any alternatives or workarounds?

Current approach: Storing extra semicolons in whitespace partially works but creates erroneous nodes
Alternative: Modifying the grammar to explicitly handle consecutive semicolons
Workaround: Pre-processing source to normalize consecutive semicolons before parsing

Any additional context

This is a work-in-progress contribution. The debug test successfully identifies the issue and the current fix prevents parse failures, but the LST generation needs refinement. The test includes comprehensive debugging output to help understand the parser's behavior with malformed import statements.

Test output shows:

Parser no longer fails completely
LST tree is generated but contains erroneous nodes
Source round-trip may not be perfect due to incorrect tree structure

Seeking guidance on the proper implementation to generate a clean LST without J.Erroneous nodes.

Enhanced the parser to detect and consume extra semicolons following import statements, storing them in the whitespace for accurate source representation. Added a debug test to verify import parsing and whitespace handling.

Refine comment for clarity in parsing success case.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

timtebeek · 2026-01-07T13:54:40Z

            case ASSIGNMENT:
            case DO_WHILE_LOOP:
-            case IMPORT:
+            case IMPORT:{


Given the fall through from above this should probably become it's only block below THROW.

timtebeek · 2026-01-07T13:55:24Z

+                    System.err.println("IMPORT: consumed " + extraSemicolons + " extra semicolon(s), cursor now at " + cursor);
+                }
+
+
+                System.err.println("IMPORT: cursor after=" + cursor + ", char='" +
+                        (cursor < source.length() ? source.charAt(cursor) : "EOF") + "'");


Let's move these prints into spec.afterRecipe in the unit test, and not here in the parser.

timtebeek · 2026-04-01T08:38:37Z

We've since seen this parallel alternative fix go in

Java: Fix parsing of extra semicolons after imports #7138

Thanks again!

Handle extra semicolons after import statements

e778f0c

Enhanced the parser to detect and consume extra semicolons following import statements, storing them in the whitespace for accurate source representation. Added a debug test to verify import parsing and whitespace handling.

github-project-automation Bot added this to OpenRewrite Nov 22, 2025

github-project-automation Bot moved this to In Progress in OpenRewrite Nov 22, 2025

moderne-meeseeks Bot assigned saranshflip Nov 22, 2025

saranshflip marked this pull request as draft November 22, 2025 06:00

saranshflip marked this pull request as ready for review November 22, 2025 06:01

saranshflip marked this pull request as draft November 22, 2025 06:01

saranshflip mentioned this pull request Nov 22, 2025

Parser fails on extraneous semicolon in import statement when followed by another import #6310

Closed

Update comment

c6f951b

Refine comment for clarity in parsing success case.

github-actions Bot reviewed Nov 25, 2025

View reviewed changes

Comment thread rewrite-java/src/test/java/org/openrewrite/java/MyParserDebug.java Outdated

Comment thread rewrite-java/src/test/java/org/openrewrite/java/MyParserDebug.java Outdated

Comment thread rewrite-java/src/test/java/org/openrewrite/java/MyParserDebug.java Outdated

timtebeek and others added 2 commits December 4, 2025 12:53

Apply suggestions from code review

09d3d15

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Merge branch 'main' into feature

44d445f

timtebeek added bug Something isn't working java parser labels Dec 4, 2025

saranshflip and others added 5 commits December 9, 2025 10:40

Changes made in MyParserDebug file to use RewriteTest

497c251

Expand and enable ImportTest.semicolonAfterPackage

21a4794

Delete MyParserDebug, as the same can be achieved in ImportTest

3842e42

Merge branch 'main' into feature

951ddca

Merge branch 'main' into feature

5559301

timtebeek reviewed Jan 7, 2026

View reviewed changes

timtebeek closed this Apr 1, 2026

github-project-automation Bot moved this from In Progress to Done in OpenRewrite Apr 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle extra semicolons after import statements#6323

Handle extra semicolons after import statements#6323
saranshflip wants to merge 9 commits intoopenrewrite:mainfrom
saranshflip:feature

saranshflip commented Nov 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timtebeek Jan 7, 2026

Uh oh!

timtebeek Jan 7, 2026

Uh oh!

timtebeek commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

saranshflip commented Nov 22, 2025

What's changed?

What's your motivation?

Anything in particular you'd like reviewers to focus on?

Anyone you would like to review specifically?

Have you considered any alternatives or workarounds?

Any additional context

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timtebeek Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

timtebeek Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

timtebeek commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants