Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: Support top-p and top-k
#1938 opened Feb 12, 2026 by zhandaz Loading…
2 of 4 tasks
fix: fix mcore sequence packing CI:L1 Run doctests, unit tests, and functional tests
#1937 opened Feb 12, 2026 by yuki-97 Loading…
Fix mcore inference
#1931 opened Feb 12, 2026 by shanmugamr1992 Loading…
4 tasks
fix: Fix device mismatch when DPO runs validation at start with CPU offload (Nemotron MoE) CI:L1 Run doctests, unit tests, and functional tests
#1930 opened Feb 12, 2026 by RayenTian Draft
4 tasks
feat: add draft model support community-request documentation Improvements or additions to documentation needs-follow-up Issue needs follow-up
#1921 opened Feb 10, 2026 by shaunjoshi Draft
4 tasks
refactor: refactor loss function
#1920 opened Feb 10, 2026 by yuki-97 Draft
perf: Fuse sequence packing for loss function
#1904 opened Feb 10, 2026 by nujoug Loading…
chore: bump mcore and mbridge CI:L1 Run doctests, unit tests, and functional tests super-v3
#1902 opened Feb 9, 2026 by yfw Loading…
4 tasks
test: Add script for nemotron test CI:L0 Run doctests and unit tests super-v3
#1901 opened Feb 9, 2026 by guyueh1 Loading…
4 tasks
feat: ProRLv2 - add seq-mask-tis truncated importance sampling type CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1899 opened Feb 9, 2026 by hijkzzz Loading…
feat: skip logprob and reference logprob computation under certain conditions CI:L1 Run doctests, unit tests, and functional tests deepseek Related to deepseek 671b
#1891 opened Feb 6, 2026 by guyueh1 Loading…
4 tasks
feat: Megatron LoRA GRPO w/ Weight Merging
#1889 opened Feb 5, 2026 by vadam5 Loading…
4 tasks
feat: MXFP8 rollout support super-v3
#1887 opened Feb 5, 2026 by guyueh1 Draft
4 tasks
feat: Support build custom flashinfer CI:L2 Run doctests, unit tests, functional tests, and convergence tests documentation Improvements or additions to documentation super-v3
#1886 opened Feb 5, 2026 by guyueh1 Loading…
4 tasks
feat: retry rollout if generation_logprobs contains NaN CI:L2 Run doctests, unit tests, functional tests, and convergence tests super-v3
#1885 opened Feb 5, 2026 by guyueh1 Loading…
4 tasks
feat: Add perfetto tracing for async GRPO training
#1876 opened Feb 4, 2026 by gspschmid Loading…
4 tasks
feat: add worker initialization timing collection CI:L0 Run doctests and unit tests CI:L1 Run doctests, unit tests, and functional tests
#1873 opened Feb 4, 2026 by yashaswikarnati Loading…
4 tasks
chore: bump torch 2.9.1, vllm 0.15 sglang 0.5.8, ray 2.53 dependencies Pull requests that update a dependency file
#1871 opened Feb 3, 2026 by terrykong Loading…
4 tasks
feat: add fault injection utilities for testing fault tolerance CI:L0 Run doctests and unit tests CI:L1 Run doctests, unit tests, and functional tests super-v3
#1868 opened Feb 3, 2026 by yashaswikarnati Loading…
4 tasks
Mdp
#1849 opened Jan 29, 2026 by shanmugamr1992 Loading…
4 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.