Skip to content

Comments

[SPARK-54657][PS] Refactor pyspark.sql.pandas.serializers for readability/reuse#54406

Closed
devin-petersohn wants to merge 1 commit intoapache:masterfrom
devin-petersohn:devin/refactor_pandas_serializers
Closed

[SPARK-54657][PS] Refactor pyspark.sql.pandas.serializers for readability/reuse#54406
devin-petersohn wants to merge 1 commit intoapache:masterfrom
devin-petersohn:devin/refactor_pandas_serializers

Conversation

@devin-petersohn
Copy link
Contributor

What changes were proposed in this pull request?

Refactor large module to break up / reuse similar logic

Why are the changes needed?

To break up / reuse similar logic

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI

Was this patch authored or co-authored using generative AI tooling?

Co-authored-by: Claude Opus 4.6

…lity/reuse

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
Co-authored-by: Devin Petersohn <devin.petersohn@snowflake.com>
@gaogaotiantian
Copy link
Contributor

Ah. You don't have to spend your time on this part for now. @Yicong-Huang is actively working on refactoring serializers and we probably don't want conflict at this point.

@Yicong-Huang
Copy link
Contributor

Yicong-Huang commented Feb 21, 2026

Thanks @devin-petersohn.

We are actively working on refactor serializers but decided to do it surgically and slowly to make sure not to introduce breaking change or regression. The goal is to reduce the number of serializers by extracting transformers and move data transformation logic out of serializers. You can follow https://issues.apache.org/jira/browse/SPARK-55388 and https://issues.apache.org/jira/browse/SPARK-55384 two umbrella tickets for the process.

@devin-petersohn
Copy link
Contributor Author

No problem! I should have made a comment on the JIRA ticket to check that this one wasn't already in progress. Your approach makes sense, but if any of this is useful at all please feel free to use any of the code here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants