forked from kmccleary3301/nested_learning
-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
enhancementNew feature or requestNew feature or requestexecution-boardExecution board ticket set for paper alignmentExecution board ticket set for paper alignmentphase-1Phase 1: optimizer equation fidelity (Eq. 21-24)Phase 1: optimizer equation fidelity (Eq. 21-24)quality-gateHas explicit acceptance criteria and test gatesHas explicit acceptance criteria and test gates
Milestone
Description
Purpose
Implement Newton-Schulz nonlinearity path for inner momentum updates to align with Eq. 24-style formulation.
Mandatory Reading (blocking)
First comment must summarize:
reports/NL_IMPLEMENTATION_ORACLE.mdsection 6.1.1 and optimizer gap notesreports/paper/NL-print.extracted.clean.txtEq. (24)src/nested_learning/optim/m3.pyNewton-Schulz implementation
Required Code Anchors
src/nested_learning/optim/deep.pysrc/nested_learning/optim/m3.pysrc/nested_learning/optim/factory.py
Scope
- Add inner variant
muon_nsusing Newton-Schulz output transform. - Clarify difference between outer Muon optimizer and inner
muon_nsmemory rule in docs. - Keep backward compatibility with current
muonconfigs.
Test Requirements
- Unit tests for NS path shape/stability.
- Deterministic toy-case checks.
Deliverables
- Variant implementation + docs + ablation config.
Acceptance Criteria
- No regression in outer optimizer behavior.
- 5k run finite with expected telemetry keys.
- First issue comment contains mandatory reading summary.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestexecution-boardExecution board ticket set for paper alignmentExecution board ticket set for paper alignmentphase-1Phase 1: optimizer equation fidelity (Eq. 21-24)Phase 1: optimizer equation fidelity (Eq. 21-24)quality-gateHas explicit acceptance criteria and test gatesHas explicit acceptance criteria and test gates