Skip to content

fix: Sync Nova hosting configs#5664

Merged
nargokul merged 1 commit intoaws:masterfrom
nargokul:nova-config-updates
Mar 23, 2026
Merged

fix: Sync Nova hosting configs#5664
nargokul merged 1 commit intoaws:masterfrom
nargokul:nova-config-updates

Conversation

@nargokul
Copy link
Contributor

Align _NOVA_HOSTING_CONFIGS CONTEXT_LENGTH and MAX_CONCURRENCY values with ALLOWLISTED_CONFIGURATIONS from https://tiny.amazon.com/f7q419f0

Key changes:

  • micro: correct context/concurrency for g5, g6 instances; add g6e types
  • lite: add g6.12xlarge, g6.24xlarge; fix p5 to 128000 context
  • pro: remove unsupported g6.48xlarge; fix p5 to 24000/1
  • lite-v2: add g6.48xlarge; fix p5 to 128000 context

Align _NOVA_HOSTING_CONFIGS CONTEXT_LENGTH and MAX_CONCURRENCY values
with ALLOWLISTED_CONFIGURATIONS from AGISageMakerInference constants.py.

Key changes:
- micro: correct context/concurrency for g5, g6 instances; add g6e types
- lite: add g6.12xlarge, g6.24xlarge; fix p5 to 128000 context
- pro: remove unsupported g6.48xlarge; fix p5 to 24000/1
- lite-v2: add g6.48xlarge; fix p5 to 128000 context
@nargokul nargokul changed the title fix: Sync Nova hosting configs with AGISageMakerInference fix: Sync Nova hosting configs Mar 23, 2026
@nargokul nargokul merged commit 2e95bc0 into aws:master Mar 23, 2026
15 of 17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants