-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Dev] add fp4 when get align size in HybridepManager
#2140
opened Nov 5, 2025 by
qijiaxing
Loading…
6 tasks
add device and dtype to empty inv_dt init
bug
Something isn't working
dev branch
Dev branch related issues and development
Expert Review
Apply this label to indicate that your PR is ready for expert review.
feat(moe): Support placing MTP layers into standalone stages
Expert Review
Apply this label to indicate that your PR is ready for expert review.
module: moe
Add CP + Sequence Packing support for Mimo
#2135
opened Nov 4, 2025 by
mehraakash
Loading…
5 of 6 tasks
[Dev] Add more tests for LayerwiseDistOpt with dist_ckpt
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Run tests
Improve ModelOpt paths & add more Nemotron/hybrid model support
#2131
opened Nov 4, 2025 by
jenchen13
Loading…
1 of 6 tasks
Fix runaway Etpt in straggler detector by resetting FLOPs accumulator
#2128
opened Nov 4, 2025 by
sbhavani
Loading…
[Dev] Remove calculation of padding token in moe routing loss
dev branch
Dev branch related issues and development
module: moe
remove training dependency from megatron core for fsdp checkpoint with EP
core_r0.15.0
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Tensorize dynamic inference mixed sampling
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Run functional tests
Trains for 50-100 steps and tests against golden values
Run tests
multi thread read full parallel save ckpt
#2104
opened Nov 3, 2025 by
861482002
Loading…
6 tasks done
Add router replay for MoE models
module: moe
#2101
opened Nov 3, 2025 by
litianjian
Loading…
6 tasks
ci: Run functional tests
Run functional tests
Trains for 50-100 steps and tests against golden values
[Dev] Remove redundant reduce in aux_loss logging
Expert Review
Apply this label to indicate that your PR is ready for expert review.
module: moe
Previous Next
ProTip!
Follow long discussions with comments:>50.