feat(graph): bf16/fp16 Parameter.AddGradient (unblock bf16 autograd training) by dndungu · Pull Request #153 · zerfoo/ztensor

dndungu · 2026-06-16T23:11:21Z

Summary

Parameter.AddGradient/ClearGradient host gradient-accumulation type switches covered float32/float64/ints but no reduced-precision float case, so bf16 (or fp16) autograd training failed at the first backward with AddGradient unsupported for this numeric type; use engine ops instead. This is the gating blocker for bf16 CrossAsset training (its layer backwards — LayerNorm/Linear/Bias — accumulate grads through this path).

Fix

Add float16.BFloat16 and float16.Float16 cases: accumulate through f32 and round on store (matching how every bf16 op publishes its result). bf16 shares f32's exponent range, so no overflow vs f32.

Tests

TestParameter_AddGradient_BFloat16 / _Float16 (round-trip via the shared helper)
TestParameter_AddGradient_BFloat16_Value — asserts two accumulations sum exactly (0.5+0.25→0.75, 1.0+0.5→1.5, both bf16-exact).
f32/f64/int CPU paths byte-identical (untouched).

General framework fix — any bf16/fp16 consumer benefits, nothing Wolf-specific. Follow-up to v1.13.0 (native bf16 GPU kernels).

The host gradient-accumulation type switch in AddGradient/ClearGradient had float32/float64/ints but no reduced-precision float case, so any bf16 (or fp16) autograd training failed at the first backward with 'AddGradient unsupported for this numeric type'. Add float16.BFloat16 and float16.Float16 cases: accumulate through f32 and round on store (matching how every bf16 op publishes its result). bf16 shares f32's exponent range, so no overflow vs f32. Unblocks bf16 CrossAsset training (its layer backwards accumulate grads here). General framework fix -- any bf16/fp16 consumer benefits; CPU f32/f64/int paths byte-identical.

dndungu merged commit cfa1b45 into main Jun 16, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(graph): bf16/fp16 Parameter.AddGradient (unblock bf16 autograd training)#153

feat(graph): bf16/fp16 Parameter.AddGradient (unblock bf16 autograd training)#153
dndungu merged 1 commit into
mainfrom
feat/bf16-addgradient

dndungu commented Jun 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dndungu commented Jun 16, 2026

Summary

Fix

Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant