-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Handle SSM sharded tensor merge OOM with CPU fallback
community-request
#4442
opened Apr 23, 2026 by
returnL
Contributor
Loading…
2 of 5 tasks
docs: fix broken links and anchors across READMEs and docs
docs-only
documentation only (docs or docstrings)
#4438
opened Apr 23, 2026 by
sbhavani
Contributor
Loading…
1 of 5 tasks
docs: fix Python version requirement and uv install commands in install
docs-only
documentation only (docs or docstrings)
#4437
opened Apr 23, 2026 by
sbhavani
Contributor
Loading…
5 tasks
get rid of weights_only=False
complexity: low
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
Run functional tests
Reorder mtp_post_process after attn backward in 1F1B schedule plan
#4430
opened Apr 22, 2026 by
gdengk
Contributor
Loading…
5 tasks
[Main] Fix invisible issues related to use_decoupled_grad for Megatron-FSDP.
complexity: low
Final Review
PR is in the "final review" stage
[DEV] fix(megatron-fsdp): compute SWiGLU/GDN split in item coordinates for non-DTensor optimizer states
#4424
opened Apr 22, 2026 by
xuwchen
Contributor
Loading…
5 tasks
fix(megatron-fsdp): compute SWiGLU/GDN split in item coordinates for non-DTensor optimizer states
complexity: low
#4423
opened Apr 22, 2026 by
xuwchen
Contributor
Loading…
5 tasks
feat(ckpt): expose validate_access_integrity knob on dist-ckpt load
complexity: low
#4422
opened Apr 22, 2026 by
asolergi-nv
Contributor
Loading…
5 tasks
fix: NVRx async compatibility and defer resiliency import
complexity: medium
core_r0.17.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
Final Review
PR is in the "final review" stage
Run functional tests
[Inference] Protocol Robustness Improvements in Coordinator (Follow-up to #4176)
community-request
#4419
opened Apr 21, 2026 by
DhineshPonnarasan
Contributor
Loading…
Unify and refactor Megatron-FSDP documentation.
complexity: medium
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
module: documentation
module: megatron-fsdp
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.