Add GPU fallback support via unified compute field by Hkhan161 · Pull Request #65 · CerebriumAI/cerebrium

Hkhan161 · 2026-04-12T17:44:51Z

Summary

compute now accepts either a string or array in the TOML:

compute = "HOPPER_H100"                                       # single GPU
compute = ["HOPPER_H100", "HOPPER_H200", "AMPERE_A100_80GB"]  # with fallbacks

First element is the primary GPU, rest are fallbacks. The CLI normalizes both forms and sends compute as an array (or string) to the backend API. No separate compute_fallbacks field exposed to users.

Changes (3 files)

config.go: ComputeRaw interface{} for TOML input, Compute *string unchanged, added ComputeFallbacks []string. Payload sends array when fallbacks present.
loader.go: normalizeCompute() splits string/array into Compute + ComputeFallbacks
validator.go: Validates primary and all fallbacks against the compute enum

No changes to

deploy.go, run.go, deploy_test.go — Compute stays *string, all existing code untouched

Backward compatible

compute = "H100" (string) works exactly as before
Omitting fallbacks changes nothing

Companion backend PR: CerebriumAI/dashboard-backend#3432

Test plan

All tests pass (go test ./...)
Deployed to dev with array syntax, verified ksvc manifest
Deployed to dev with string syntax, verified backward compat
Deployed to dev with no tier/no fallbacks, verified default behavior

elijah-rou

Yeah same comment as backend, another param seems bloaty here

compute now accepts either a string or array in the TOML: compute = "HOPPER_H100" compute = ["HOPPER_H100", "HOPPER_H200", "AMPERE_A100_80GB"] First element is primary, rest are fallbacks. CLI normalizes both forms and sends compute as array to the backend API. Changes: - config.go: ComputeRaw (interface{}) for TOML, Compute (*string) unchanged - loader.go: normalizeCompute() splits string/array into Compute + ComputeFallbacks - validator.go: validates primary and fallbacks against compute enum No changes to deploy.go, run.go, or tests — Compute stays *string. Companion backend PR: CerebriumAI/dashboard-backend#3432 Made-with: Cursor

elijah-rou reviewed Apr 13, 2026

View reviewed changes

Hkhan161 force-pushed the harris/gpu-fallbacks branch 6 times, most recently from 888b060 to 2a33baf Compare April 15, 2026 01:02

Hkhan161 changed the title ~~Add compute_fallbacks support for GPU fallback scheduling~~ Add GPU fallback support via unified compute field Apr 15, 2026

Hkhan161 force-pushed the harris/gpu-fallbacks branch from 2a33baf to b0155a0 Compare April 15, 2026 01:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GPU fallback support via unified compute field#65

Add GPU fallback support via unified compute field#65
Hkhan161 wants to merge 1 commit intomainfrom
harris/gpu-fallbacks

Hkhan161 commented Apr 12, 2026 •

edited

Loading

Uh oh!

elijah-rou left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Hkhan161 commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes (3 files)

No changes to

Backward compatible

Test plan

Uh oh!

elijah-rou left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Hkhan161 commented Apr 12, 2026 •

edited

Loading