-
Notifications
You must be signed in to change notification settings - Fork 255
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[3.1/4] Diffusion Quantized ckpt export - WAN 2.2 14B
#855
opened Feb 5, 2026 by
jingyu-ml
Loading…
3 of 4 tasks
OMNIML-2663] Replace modelopt FP8 QDQ nodes with native ONNX QDQ nodes
#852
opened Feb 4, 2026 by
ajrasane
Loading…
Track global_amax for weight FP4 MSE sweep; Refactor to NVFP4StaticQantizer, NVFP4MSECalibrator
#849
opened Feb 3, 2026 by
realAsma
Loading…
GPTQ[1/N]: Implement layer activation getter
#840
opened Feb 2, 2026 by
sugunav14
Loading…
1 task done
Fix TEGroupedLinear quantization for expert parallelism (EP > 1)
#833
opened Jan 30, 2026 by
yueshen2016
Loading…
chore(onnx): Remove dead code in find_scales and add test coverage
#832
opened Jan 30, 2026 by
shantoislamdev
Loading…
[Minor] fix: do not requantize the scales in FP8 scale sweep calibration
#825
opened Jan 28, 2026 by
Fridah-nv
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.