-
Notifications
You must be signed in to change notification settings - Fork 838
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(rollout): support non extra gpu placement when using rollout-external mode
#1997
opened May 30, 2026 by
shinytang6
Loading…
fix(logging): partition raw rewards for correct samples
#1996
opened May 30, 2026 by
Jiang020609
Loading…
[sft] rebuild the sft loss mask generator and add ci
#1994
opened May 30, 2026 by
zhuzilin
Contributor
Loading…
Add timeout configuration for on policy distillation HTTP session.
#1970
opened May 28, 2026 by
qqwqqw689
Contributor
Loading…
fix:TorchMemorySaver observes invalid LD_PRELOAD. when add --disable-weights-backuper
#1937
opened May 22, 2026 by
zyfzjsc988
Loading…
feat: add SFT entropy logging and validation loss monitoring
#1925
opened May 19, 2026 by
none0663
Contributor
Loading…
fix(debug): auto-append rollout_id/rank in save_debug_train_data path template
#1922
opened May 19, 2026 by
wlf-darkmatter
Loading…
Fix RolloutManager reward normalization for uneven rollout groups
#1918
opened May 18, 2026 by
haoyang9804
Loading…
feat: add --max-checkpoint-count to limit saved checkpoints
#1914
opened May 16, 2026 by
JIANG54864
Loading…
fix: use getattr for sglang_speculative_algorithm to avoid AttributeError
#1913
opened May 15, 2026 by
none0663
Contributor
Loading…
Support custom rollout-proxy TIS hooks in bypass mode
#1912
opened May 15, 2026 by
sjtushenhai
Loading…
fix: add eval-before-train to train_async.py (parity with train.py)
#1906
opened May 13, 2026 by
Taosheng-ty
Loading…
4 tasks done
feat: filter logits by loss_mask before log_probs/entropy computation
#1905
opened May 13, 2026 by
Taosheng-ty
Loading…
5 of 6 tasks
fix: preserve fused 3D expert tensors for Qwen3.5 MoE in torch_dist→H…
#1904
opened May 12, 2026 by
rouchenzi
Loading…
fix: restore actor weights after loading OPD teacher checkpoint
#1903
opened May 12, 2026 by
canlin03
Loading…
Filter zero-advantage samples in convert_samples_to_train_data
#1901
opened May 11, 2026 by
nanjiangwill
Collaborator
Loading…
fix: add fallback for --save-hf when Megatron-Bridge lacks model support
#1881
opened Apr 30, 2026 by
WangHong-yang
Contributor
Loading…
3 tasks done
feat(profile): safer torch.profiler defaults + per-grad-step capture
#1879
opened Apr 29, 2026 by
leofan-lab
Contributor
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-04-30.