Skip to content

Conversation

@yewentao256
Copy link
Member

@yewentao256 yewentao256 commented Aug 25, 2025

Purpose

We don't let deepgemm env to control fp8 linear, this PR fixes the bug

Signed-off-by: yewentao256 <zhyanwentao@126.com>
@yewentao256 yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 25, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a fix to ensure that the VLLM_USE_DEEP_GEMM environment variable correctly controls the usage of DeepGEMM for FP8 linear layers. The change adds a check for this environment variable in the should_use_deepgemm_for_fp8_linear function, making its behavior consistent with other DeepGEMM-powered components in vLLM. The implementation is correct and effectively addresses the described bug. My review found no issues with the proposed changes.

@mgoin
Copy link
Member

mgoin commented Aug 25, 2025

Should we put it in the base check? I'm not sure where else these functions are called

Signed-off-by: yewentao256 <zhyanwentao@126.com>
@yewentao256
Copy link
Member Author

Should we put it in the base check? I'm not sure where else these functions are called

Fixed, thanks!

@simon-mo simon-mo merged commit 56dcf4e into vllm-project:main Aug 26, 2025
34 of 36 checks passed
tc-mb pushed a commit to tc-mb/vllm that referenced this pull request Aug 27, 2025
Signed-off-by: yewentao256 <zhyanwentao@126.com>
Signed-off-by: tc-mb <caitianchi@modelbest.cn>
epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025
Signed-off-by: yewentao256 <zhyanwentao@126.com>
xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025
Signed-off-by: yewentao256 <zhyanwentao@126.com>
Signed-off-by: Xiao Yu <xiao.yu@amd.com>
zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025
Signed-off-by: yewentao256 <zhyanwentao@126.com>
zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Sep 3, 2025
Signed-off-by: yewentao256 <zhyanwentao@126.com>
FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025
Signed-off-by: yewentao256 <zhyanwentao@126.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants