Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

docs: improve code comments module: documentation
#1712 opened Jul 26, 2025 by lorinlee Loading…
Fix the typo in readme module: documentation
#1695 opened Jul 17, 2025 by DNXie Loading…
Update pretrain_mamba.py bug Something isn't working module: documentation
#1682 opened Jul 11, 2025 by vignesh1507 Loading…
Issue 1672 fix: initializing the current pointed with int64 to avoid … bug Something isn't working
#1673 opened Jul 7, 2025 by sharanmayank Loading…
Support 1f1b a2a overlap module: distributed
#1671 opened Jul 7, 2025 by lhb8125 Loading…
moe: remove unused variable scale_up module: moe
#1670 opened Jul 6, 2025 by WineChord Loading…
Update README.md module: documentation
#1660 opened Jul 2, 2025 by 21jun Loading…
Fix log-timer-to-tensorboard on logging module: debugging
#1631 opened Jun 13, 2025 by wplf Loading…
Set weights_only=False in optimizer module: optimizer
#1618 opened Jun 9, 2025 by zhic-mt Loading…
ProTip! Exclude everything labeled bug with -label:bug.