forked from kmccleary3301/nested_learning
-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
enhancementNew feature or requestNew feature or requestexecution-boardExecution board ticket set for paper alignmentExecution board ticket set for paper alignmentphase-4Phase 4: distributed faithful path parityPhase 4: distributed faithful path parityquality-gateHas explicit acceptance criteria and test gatesHas explicit acceptance criteria and test gates
Milestone
Description
Purpose
Bring distributed training (starting with FSDP) to paper-faithful parity with single-GPU online/per-layer behavior.
Mandatory Reading (blocking)
First comment must summarize:
reports/NL_IMPLEMENTATION_ORACLE.mdsections 5.3 and 6.1.4docs/PAPER_COMPLIANCE.mddistributed caveatstrain_fsdp.pysrc/nested_learning/training.pydistributed guards and online loop
Required Code Anchors
train_fsdp.pytrain_dist.pysrc/nested_learning/training.py- parity tests in
tests/
Scope
- Add FSDP support for:
- online chunk updates
- per-layer teach signals
- Keep fail-fast behavior explicit for unsupported combinations.
- Add single-GPU vs FSDP parity harness.
Deliverables
- Updated FSDP path.
- Parity test script + report template.
Acceptance Criteria
- 1k-step FSDP faithful run completes.
- Parity drift against single-GPU baseline is within defined tolerance.
- First issue comment contains mandatory reading summary.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestexecution-boardExecution board ticket set for paper alignmentExecution board ticket set for paper alignmentphase-4Phase 4: distributed faithful path parityPhase 4: distributed faithful path parityquality-gateHas explicit acceptance criteria and test gatesHas explicit acceptance criteria and test gates