Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix aux loss computation with per-token loss and dynamic-cp
#4302 opened Apr 14, 2026 by xiaoyao0115 Contributor Draft
5 tasks
Inference | Per-block MoE routing storage for prefix caching complexity: medium Final Review PR is in the "final review" stage
#4301 opened Apr 14, 2026 by lmcafee-nvidia Contributor Loading…
2 of 3 tasks
[main] thd support and GDN packed-seq alignment community-request
#4296 opened Apr 14, 2026 by DAISY-gh Contributor Loading…
3 tasks
[fix] fix optimizer community-request Final Review PR is in the "final review" stage
#4294 opened Apr 14, 2026 by pavelgein Loading…
1 of 5 tasks
Cuda graph fix Final Review PR is in the "final review" stage
#4285 opened Apr 13, 2026 by i-riyad Contributor Loading…
5 tasks
ci: add workflow_dispatch support to cicd-main.yml
#4275 opened Apr 13, 2026 by ko3n1g Contributor Draft
3 tasks
fix mfsdp unwrap stuck at MegatronFSDP complexity: low Final Review PR is in the "final review" stage module: megatron-fsdp
#4274 opened Apr 13, 2026 by wplf Member Loading… Core 0.16
[Dev] Support delayed wgrad compute overlap with P2P backward
#4268 opened Apr 13, 2026 by Wohox Contributor Draft
5 tasks
[Main] Fix TE version check for retain_pinned_cpu_buffers in cpu offload complexity: low Final Review PR is in the "final review" stage
#4267 opened Apr 13, 2026 by BestJuly Contributor Loading… Core 0.16
Get device correctly when module returns a dict instead of individual tensor Approved All necessary approvals have been made complexity: low
#4265 opened Apr 13, 2026 by shifangx Contributor Loading…
5 tasks
Factor RL-specific code out of training.py complexity: high
#4264 opened Apr 12, 2026 by tdene Contributor Loading…
5 tasks
Core 0.16
ProTip! What’s not been updated in a month: updated:<2026-03-14.