Skip to content

Comments

[SPARK-55493] [SS] Do not mkdirs in streaming checkpoint offset/commit log directory in StateDataSource#54381

Open
liviazhu wants to merge 2 commits intoapache:masterfrom
liviazhu:liviazhu-db/stds-metadatalog-fix
Open

[SPARK-55493] [SS] Do not mkdirs in streaming checkpoint offset/commit log directory in StateDataSource#54381
liviazhu wants to merge 2 commits intoapache:masterfrom
liviazhu:liviazhu-db/stds-metadatalog-fix

Conversation

@liviazhu
Copy link
Contributor

What changes were proposed in this pull request?

Previously, we try to create a new directory for offsets and commits in the checkpoint directory if they don't exist when running StateDataSource. This is because the utility functions are shared with streaming query functionality which needs to create these dirs if they don't exist. This change creates new readOnly modes for the utilities so that datasources do not need to mkdirs.

Why are the changes needed?

Allow usage of StateDataSource on checkpoints that are read-only.

Does this PR introduce any user-facing change?

No

How was this patch tested?

New unit tests

Was this patch authored or co-authored using generative AI tooling?

Generated-by: claude opus 4.6

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant