Skip to content

Conversation

@kosiew
Copy link
Owner

@kosiew kosiew commented Jun 23, 2025

Which issue does this PR close?

  • Closes # (tracking support for fine-grained Parquet writer configuration)

Rationale for this change

Allow DataFrame.write_parquet to accept either compression arguments or a ParquetWriterOptions instance, simplifying the API.

What changes are included in this PR?

  • Introduced ParquetWriterOptions and ParquetColumnOptions dataclasses
  • Added write_parquet_with_options method and autodetection logic in write_parquet
  • Exported the new classes from the package
  • Added unit tests for the new behaviour

Are these changes tested?

Yes, new tests added under tests/test_dataframe.py.

Are there any user-facing changes?

write_parquet now accepts a ParquetWriterOptions instance as the second argument.


https://chatgpt.com/codex/tasks/task_e_685938ed416083248e907180e6358d76

@kosiew kosiew marked this pull request as ready for review June 23, 2025 12:14
@kosiew kosiew merged commit 280b65b into parquet-1162 Jun 23, 2025
0 of 2 checks passed
@kosiew kosiew deleted the codex/overload-write_parquet-to-auto-detect-options branch July 16, 2025 04:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants