Skip to content

Commit 019cd07

Browse files
yuantailinggovind-ramnarayan
authored andcommitted
[None][fix] Fix is_post_quant_all2all_supported for MNNVL (NVIDIA#8355)
Signed-off-by: Tailing Yuan <yuantailing@gmail.com>
1 parent 813739a commit 019cd07

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tensorrt_llm/_torch/modules/fused_moe/fused_moe_wide_ep.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -445,7 +445,7 @@ def is_post_quant_all2all_supported(self):
445445
if not self.use_postquant_alltoall:
446446
return False
447447
if self.alltoall_method_type == AlltoallMethodType.MNNVL:
448-
return False
448+
return True
449449
elif self.alltoall_method_type == AlltoallMethodType.DeepEP:
450450
return self.has_nvfp4
451451
elif self.alltoall_method_type == AlltoallMethodType.DeepEPLowLatency:

0 commit comments

Comments
 (0)