-
Notifications
You must be signed in to change notification settings - Fork 3.7k
[fix](parquet) Don't decompress dict page when dict page is empty #60374
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
[fix](parquet) Don't decompress dict page when dict page is empty #60374
Conversation
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
4827ed9 to
3051f98
Compare
|
run buildall |
|
@kaka11chen Hello, do you have time to see this fix? |
TPC-H: Total hot run time: 32044 ms |
ClickBench: Total hot run time: 28.34 s |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
3051f98 to
c93b096
Compare
When a string column's data are all null, the dict page may be empty. The error message is as follows: INTERNAL_ERROR]Read parquet file hdfs://HDFS82742/ydbi/original/server/tlbbgl/auction_zstd/dt=2024-12-13/084cadfc5200b4ad-c2b2568a00000045_1132749056_data.0.parq failed, reason = [INVALID_ARGUMENT]ZSTD_decompressDCtx error: Unknown frame descriptor. cur path: xxx We needn't decompress dcit page data when dict page is empty and just cache empty data as decompressed data.
…ictI32::insert_many_dict_data` when dict data is empty.
c93b096 to
04583b0
Compare
|
run buildall |
TPC-H: Total hot run time: 31846 ms |
ClickBench: Total hot run time: 28.34 s |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
When a string column's data are all null, the dict page may be empty.
The error message is as follows:
We needn't decompress dcit page data when dict page is empty and just cache empty data as decompressed data.
What problem does this PR solve?
Issue Number: close #xxx
Related PR: #xxx
Problem Summary:
Release note
None
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)