Fix some speculative decode tests with tl.dot#17371
Fix some speculative decode tests with tl.dot#17371vllm-bot merged 2 commits intovllm-project:mainfrom
Conversation
Signed-off-by: Huy Do <huydhn@gmail.com>
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
|
This fixes some spec decode failures but not all of them. Specifically, https://github.com/vllm-project/vllm/blob/main/.buildkite/test-pipeline.yaml#L283 has been fixed, but there are more failures from the next line |
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>
Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>
I'm seeing these failures from my other PR #16859, but they don't seem to be related to PyTorch 2.7.0 release. They seem to come from #13305 with the issue on Triton triton-lang/triton#2266. I have seen similar a PR from PyTorch about this pytorch/pytorch#147765.
The PR attempts to fix the failed tests accordingly.