-
Notifications
You must be signed in to change notification settings - Fork 677
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enhanced Off-Policy Async Rollout with Staleness Control and Partial Rollout Support
#1781
opened Mar 30, 2026 by
huang3eng
Loading…
Add host memory metrics to available_memory function
#1764
opened Mar 25, 2026 by
peterjc123
Loading…
[Fix] Initialize grad_norm before found_inf skip path
#1762
opened Mar 24, 2026 by
kaysonyu
Loading…
[kimi25 rl part4] Support K25 HF weight conversion between BF16\FP8\INT4
#1757
opened Mar 23, 2026 by
Gao016
Loading…
[kimi25 rl part2] pass megatron bridge provider args from slime config
#1754
opened Mar 23, 2026 by
GeLee-Q
Loading…
[kimi25 rl part1.2] support kimi25 q-lora pairing in bridge update path (weight update for train-infer colocate)
#1753
opened Mar 23, 2026 by
GeLee-Q
Loading…
Add Mooncake Backend for Rollout Data Transfer
run-ci-megatron
#1709
opened Mar 11, 2026 by
zxpdemonio
Loading…
6 tasks done
fix: normalize rewards per-group when sample counts are unequal
#1655
opened Mar 2, 2026 by
dubin555
Loading…
2 of 3 tasks
feat: Add knowledge distillation example with offline support
#1654
opened Mar 2, 2026 by
tourzhao
Loading…
3 tasks
Fix the Rotary Position Embedding (RoPE) parameter passing in the GLM5 mode
#1650
opened Mar 2, 2026 by
hanxdmech-ship-it
Loading…
[WIP] fix transforrmers api change at 5.2.0
run-ci-megatron
#1647
opened Feb 28, 2026 by
UbeCc
Loading…
fix(r3,vlm): remove orphaned RoutingReplay from decoder rebuild.
#1620
opened Feb 24, 2026 by
yxyOo
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.