Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Mamba] -Consolidate Mambas Attention Logic v1
#28133 opened Nov 5, 2025 by Josephasafg Draft
1 of 5 tasks
[CPU] Enable torch profiling v1
#28130 opened Nov 5, 2025 by aditew01 Loading…
[misc] add vLLM Beijing Meetup documentation Improvements or additions to documentation
#28127 opened Nov 5, 2025 by jjzhang Loading…
[Chore] Remove Nemotron-Nano-VL config copy ready ONLY add when PR is ready to merge/full CI is needed
#28126 opened Nov 5, 2025 by Isotr0py Loading…
3 of 5 tasks
[Bugfix] Fix Qwen3-Reranker-8B load qwen Related to Qwen models
#28117 opened Nov 5, 2025 by noooop Loading…
5 tasks
[V0 deprecation]clean up is_v1_supported_oracle ready ONLY add when PR is ready to merge/full CI is needed v1
#28116 opened Nov 5, 2025 by wangxiyuan Loading…
5 tasks
[V0 deprecation] Deprecate use_v1 parameter rocm Related to AMD ROCm tpu Related to Google TPUs
#28112 opened Nov 5, 2025 by wangxiyuan Loading…
5 tasks
[Misc] Remove the duplicate code frontend ready ONLY add when PR is ready to merge/full CI is needed
#28111 opened Nov 5, 2025 by chaunceyjiang Loading…
5 tasks
[CLI] add --max-tokens to vllm complete frontend
#28109 opened Nov 5, 2025 by Iceber Loading…
3 of 5 tasks
Use maximum number of batched tokens to autotune MoE nvidia
#28106 opened Nov 5, 2025 by nvjullin Loading…
5 tasks
[Model] Consolidate Deepseek-MoE implementation with DeepSeek-v2 deepseek Related to DeepSeek models new-model Requests to new models
#28101 opened Nov 5, 2025 by Isotr0py Loading…
3 of 5 tasks
Flashinfer: TopK+TopP sampling from probs v1
#28099 opened Nov 5, 2025 by Kh4L Loading…
[Kernel] Fuse computation of g and beta for Gated Delta Net qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#28095 opened Nov 5, 2025 by ZJY0516 Loading…
5 tasks
[Docs] Add guide to debugging vLLM-torch.compile integration documentation Improvements or additions to documentation
#28094 opened Nov 5, 2025 by zou3519 Loading…
[Docs] Clean up README_TUNING.md
#28088 opened Nov 5, 2025 by windsonsea Loading…
[PERF] Decouple projections from GDN custom op. Attempt 2 qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#28083 opened Nov 5, 2025 by vadiklyutiy Loading…
Add runai model streamer e2e test for GCS ci/build
#28079 opened Nov 4, 2025 by amacaskill Loading…
5 tasks done
ProTip! Exclude everything labeled bug with -label:bug.