fix: unignore input_file_name Spark SQL tests for native_datafusion #3458

andygrove · 2026-02-09T19:56:34Z

Summary

Need to rebase once #3446 is merged

Fix fallback logic for input file name metadata
Enable more tests

The native_datafusion scan now correctly falls back to Spark's FileSourceScanExec when metadata columns (like input_file_name) are present, so the 3 input_file_name tests no longer need to be ignored. For ExtractPythonUDFsSuite, the issue was that the test's collect pattern didn't match CometNativeScanExec. Fixed by adding CometNativeScanExec to the collect and dataFilters match blocks. Closes apache#3312 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The previous commit accidentally removed the IgnoreComet.scala file creation from the diff, causing 94 compilation errors when applied to Spark 3.5.8. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

CometScanExec does not populate InputFileBlockHolder (the thread-local that Spark's FileScanRDD sets), so input_file_name(), input_file_block_start(), and input_file_block_length() return empty or default values when Comet replaces the scan. Detect these expressions in the plan and fall back to Spark's FileSourceScanExec. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

andygrove and others added 3 commits February 9, 2026 12:47

fix: restore IgnoreComet.scala in 3.5.8 Spark SQL test diff

188cd86

The previous commit accidentally removed the IgnoreComet.scala file creation from the diff, causing 94 compilation errors when applied to Spark 3.5.8. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

andygrove force-pushed the fix/unignore-input-file-name-tests branch from 432c277 to ab357d1 Compare February 10, 2026 19:10

andygrove marked this pull request as ready for review February 10, 2026 20:46

andygrove requested a review from mbutrovich February 10, 2026 20:47

andygrove marked this pull request as draft February 10, 2026 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: unignore input_file_name Spark SQL tests for native_datafusion #3458

fix: unignore input_file_name Spark SQL tests for native_datafusion #3458

andygrove commented Feb 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: unignore input_file_name Spark SQL tests for native_datafusion #3458

Are you sure you want to change the base?

fix: unignore input_file_name Spark SQL tests for native_datafusion #3458

Conversation

andygrove commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

andygrove commented Feb 9, 2026 •

edited

Loading