Disable chunk uploads for the sync rpm upload endpoint to improve performance on console by YasenT · Pull Request #1306 · pulp/pulp-cli

YasenT · 2026-02-03T09:25:29Z

Disable chunk uploads
And retrigger tests to see if it fails at same place. As I see that the nighties run just fine

pulp-glue/pulp_glue/rpm/context.py

YasenT · 2026-02-06T12:28:29Z

@mdellweg can you re-run the tests? It's failing again on a random place. I can't do it myself
And i guess we need them to pass before merging?

jobselko · 2026-02-06T12:43:48Z

@YasenT I just re-ran the tests. Could you please add a changelog entry, since this changes the functionality?

mdellweg · 2026-02-06T13:21:20Z

Yes, we should have the tests pass. But it's only necessary once we have established there's no pending change anymore. We are working on the random test failures.

YasenT · 2026-02-06T13:54:51Z

@YasenT I just re-ran the tests. Could you please add a changelog entry, since this changes the functionality?

Added a changelog

YasenT · 2026-02-06T13:55:42Z

Yes, we should have the tests pass. But it's only necessary once we have established there's no pending change anymore. We are working on the random test failures.

Anything else you would like changed? :)

jobselko · 2026-02-09T08:39:25Z

@YasenT Any reason why Gerrod is marked as the commit author? I am guessing this was by accident.

mdellweg · 2026-02-09T08:44:14Z

Before I will comment more about the excessive comments still left, let me try to understand what this change is all about. When you have a huge file, it always used to be chunked up. Now with this change it will always not be chunked up, thereby running into the exact server limits the chunking was invented for in the first place. And for that you take away the user control over the chunk size. Couldn't you instead just tell the user to increase the chunk size? Isn't this a docs issue in the first place? Do we need a globally configurable chunk_size default?

YasenT · 2026-02-09T10:55:29Z

@mdellweg
Actually I like having comments, I believe that they help, not sure if this changes anything in code, it's more of a personal viewpoint. But can we move forward with this? Not sure if every comment needs to be reviewed, but if it needs be, can I get the review so that we can complete these?

As for the chunking -> the default behavior actually doesn't change. It will still chunk at default 1MB, and people can set the chunk size if they want to.
There's a new synchronous endpoint in pulp_rpm, which is being called only when someone is trying to upload an RPM without providing a repository. I introduced the use of this endpoint in the cli a few months ago for this specific case. And we don't want chunking for that specific endpoint, to reduce the overhead coming from it.
And this is what the commit is targeting in fact.

mdellweg

My last comment wasn't even about the comments, but about the fact that the command has a chunk_size (probably with a badly chosen default) that your changes just silently start to ignore. My point is that even in the face of the upload endpoint, there is a limit to the size of file you can upload in one go.
So my specific question here is, should we make the chunk_size globally configurable?

YasenT · 2026-02-10T09:52:12Z

My last comment wasn't even about the comments, but about the fact that the command has a chunk_size (probably with a badly chosen default) that your changes just silently start to ignore. My point is that even in the face of the upload endpoint, there is a limit to the size of file you can upload in one go. So my specific question here is, should we make the chunk_size globally configurable?

On the chunk size, I think that the default value should be dynamic and based off the size of the file being uploaded.

But my changes aren't affecting the default behavior/flow that users are used to with the async endpoint. The changes affect just the synchronous one, which wasn't available through the pulp-cli till recently. And there the target is to reduce the tasking/load on the server side.
We increased the timeouts (on console) so that they won't be a limiting factor for the duration of an upload, which was the main factor limiting how big of a file you can upload without chunking it
And it is faster to upload 1x big file than a bunch of chunks, on tcp/ip level, as you don't force it (tcp) to go into ramp-up(slow-start) phase for each chunk. Our chunks upload sequentially, so they actually slow you down.

mdellweg · 2026-02-12T09:13:06Z

The chunksize is there to reflect a limitation of the server, and therefore no, it does not depend on the file, but on the server you upload to. Your change trades possible failure for some performance improvements in a way that leaves the user no way to react. So I come back to my original question: What is a good default chunksize and should we make it a per server setting?

YasenT · 2026-02-12T14:30:53Z

@mdellweg the chunk uploads are causing issues as is. Which is actually turning it into a guess game for the end user - is it enough to set 20M, 60M, etc.
Would it be okay for you, if for this endpoint chunking is disabled by default except if the user doesn't specify a size? Which in general feels the more "natural" flow. Enable chunking only if i need it, instead of using it to go around the "defaults"

mdellweg · 2026-02-12T14:54:28Z

@mdellweg the chunk uploads are causing issues as is. Which is actually turning it into a guess game for the end user - is it enough to set 20M, 60M, etc.

Yes, sadly it's a guessing game if the user does not have access to the very nginx/apache configuration. Certainly they cannot simply adjust that setting, therefore we cannot just skip chunking for infinitely large files.

Would it be okay for you, if for this endpoint chunking is disabled by default except if the user doesn't specify a size? Which in general feels the more "natural" flow. Enable chunking only if i need it, instead of using it to go around the "defaults"

This is about what is the best solution for most users that at the same time does not prevent any user from uploading a file that they could upload before. Also it's not about the api-endpoint. The user of this library is deliberately abstracted away from the api. That's why there is some heuristic to determine how to accomplish the task promised to the user.

So if I understand your suggestion correctly, we would allow the chunk size to be unspecified (reading that as infinity), and instead of finding a proper default value drop it completely.
I would then still add a way to configure the chunksize (not just for RPM) in each server profile so when hitting the limit once the user can use the experience to persist that parameter in the settings and forget about it.

mdellweg · 2026-02-17T15:53:41Z

How about we do this:

So if I understand your suggestion correctly, we would allow the chunk size to be unspecified (reading that as infinity), and instead of finding a proper default value drop it completely.
And may i add make unspecified the default.
I think the ridiculously small value is a result of actually wanting to trigger that codepath in a not too expensive test scenario. But since we learned that there is not the one right answer, we should not claim to have the proper default.

I can take adding the settings aspect of this later.

YasenT · 2026-02-18T08:20:18Z

@mdellweg I hope I understood you correctly, and made the change global.
Initially went with sys.maxsize but this caused some errors, so right now i'm going with a lower value.

mdellweg

Close enough. Thank you!

github-actions bot added no-changelog no-issue labels Feb 3, 2026

YasenT changed the title ~~Update context.py~~ Disable chunk uploads for the sync rpm upload endpoint to improve performance on console Feb 3, 2026

mdellweg reviewed Feb 5, 2026

View reviewed changes

pulp-glue/pulp_glue/rpm/context.py Outdated Show resolved Hide resolved

github-actions bot added the multi-commit label Feb 6, 2026

YasenT force-pushed the disable-chunks branch from 57cc4fa to a2e03d3 Compare February 6, 2026 11:53

github-actions bot removed the multi-commit label Feb 6, 2026

YasenT force-pushed the disable-chunks branch from a2e03d3 to 683be58 Compare February 6, 2026 13:41

github-actions bot added multi-commit and removed no-changelog labels Feb 6, 2026

YasenT force-pushed the disable-chunks branch from 683be58 to a1361cb Compare February 6, 2026 13:54

github-actions bot removed the multi-commit label Feb 6, 2026

YasenT requested a review from mdellweg February 6, 2026 15:05

YasenT force-pushed the disable-chunks branch from a1361cb to 2f33fcc Compare February 6, 2026 15:26

github-actions bot added the no-changelog label Feb 6, 2026

YasenT force-pushed the disable-chunks branch from 2f33fcc to 7fe4eb4 Compare February 6, 2026 15:30

github-actions bot removed the no-changelog label Feb 6, 2026

YasenT force-pushed the disable-chunks branch 2 times, most recently from e148c8d to b7cf6f4 Compare February 9, 2026 12:53

jobselko previously approved these changes Feb 9, 2026

View reviewed changes

mdellweg requested changes Feb 9, 2026

View reviewed changes

YasenT requested a review from mdellweg February 10, 2026 10:20

YasenT dismissed jobselko’s stale review via 826fd15 February 17, 2026 20:40

github-actions bot added multi-commit no-changelog labels Feb 17, 2026

YasenT force-pushed the disable-chunks branch from 01f5aa1 to 5e108f1 Compare February 17, 2026 20:58

Change the default chunk size to infinite

86cf084

YasenT force-pushed the disable-chunks branch from 5e108f1 to 86cf084 Compare February 17, 2026 21:14

github-actions bot removed multi-commit no-changelog labels Feb 17, 2026

max.size triggers cpythern error

4b7c34a

github-actions bot added multi-commit no-changelog labels Feb 17, 2026

mdellweg approved these changes Feb 18, 2026

View reviewed changes

mdellweg merged commit 1cc25cd into pulp:main Feb 18, 2026
18 checks passed

Conversation

YasenT commented Feb 3, 2026

Uh oh!

Uh oh!

YasenT commented Feb 6, 2026

Uh oh!

jobselko commented Feb 6, 2026

Uh oh!

mdellweg commented Feb 6, 2026

Uh oh!

YasenT commented Feb 6, 2026

Uh oh!

YasenT commented Feb 6, 2026

Uh oh!

jobselko commented Feb 9, 2026

Uh oh!

mdellweg commented Feb 9, 2026

Uh oh!

YasenT commented Feb 9, 2026

Uh oh!

mdellweg left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

YasenT commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdellweg commented Feb 12, 2026

Uh oh!

YasenT commented Feb 12, 2026

Uh oh!

mdellweg commented Feb 12, 2026

Uh oh!

mdellweg commented Feb 17, 2026

Uh oh!

YasenT commented Feb 18, 2026

Uh oh!

mdellweg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

mdellweg left a comment •

edited

Loading

YasenT commented Feb 10, 2026 •

edited

Loading