Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Handle SSM sharded tensor merge OOM with CPU fallback community-request
#4442 opened Apr 23, 2026 by returnL Contributor Loading…
2 of 5 tasks
Added NVFP4 MTP layer support
#4439 opened Apr 23, 2026 by sanandaraj5597 Contributor Loading…
docs: fix broken links and anchors across READMEs and docs docs-only documentation only (docs or docstrings)
#4438 opened Apr 23, 2026 by sbhavani Contributor Loading…
1 of 5 tasks
docs: fix Python version requirement and uv install commands in install docs-only documentation only (docs or docstrings)
#4437 opened Apr 23, 2026 by sbhavani Contributor Loading…
5 tasks
get rid of weights_only=False complexity: low Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review. Run functional tests
#4434 opened Apr 22, 2026 by dimapihtar Contributor Loading…
5 tasks
Core 0.16
fix cudagraph nonconvergence
#4433 opened Apr 22, 2026 by jiemingz Contributor Draft
5 tasks
Reorder mtp_post_process after attn backward in 1F1B schedule plan
#4430 opened Apr 22, 2026 by gdengk Contributor Loading…
5 tasks
Adding code for Flextron complexity: high
#4429 opened Apr 22, 2026 by sheliang-nv Loading…
2 of 5 tasks
Core 0.16
[DO NOT MERGE] Cye/mfsdp devgrad revert
#4428 opened Apr 22, 2026 by cspades Member Draft
5 tasks
Add misc CUDA graph sugar to CudaGraphManager complexity: low
#4425 opened Apr 22, 2026 by tdene Contributor Loading…
5 tasks
Core 0.16
fix: NVRx async compatibility and defer resiliency import complexity: medium core_r0.17.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge. Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review. Final Review PR is in the "final review" stage Run functional tests
#4420 opened Apr 22, 2026 by sbak5 Contributor Loading…
5 tasks
Core 0.16
Unify and refactor Megatron-FSDP documentation. complexity: medium Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review. module: documentation module: megatron-fsdp
#4418 opened Apr 21, 2026 by cspades Member Loading…
5 tasks
Core 0.16
Deprecate static inference
#4413 opened Apr 21, 2026 by santhnm2 Contributor Draft
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.