Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add function id tool-calling
#42921 opened May 18, 2026 by Alex-ai-future Draft
4 tasks
[CPU] Support cpu compressed-tensor w8a8 int8 moe cpu Related to CPU backends
#42920 opened May 18, 2026 by yuwenzho Contributor Draft
4 tasks
Add DeepSeek-V4 XPU support with FP8 KV cache deepseek Related to DeepSeek models intel-gpu Related to Intel GPU rocm Related to AMD ROCm v1
#42919 opened May 18, 2026 by majian4work Contributor Loading…
add svg documentation Improvements or additions to documentation
#42918 opened May 18, 2026 by gracie-guo Loading…
4 tasks
docs: clarify CUDA nightly wheel index priority documentation Improvements or additions to documentation nvidia
#42917 opened May 18, 2026 by AmanPandey28 Loading…
4 tasks
[ROCm][CI] Stabilize ROCm pooling and multimodal CI multi-modality Related to multi-modality (#4194) qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#42909 opened May 18, 2026 by AndreasKaratzas Collaborator Loading…
[ROCm][Perf] Enabled FP4Indexer for DSV4 rocm Related to AMD ROCm v1
#42908 opened May 18, 2026 by tjtanaa Collaborator Draft
4 tasks
[MRV2][CI] Add update_config method for V2 Runner ready ONLY add when PR is ready to merge/full CI is needed v1
#42907 opened May 18, 2026 by jikunshang Collaborator Loading…
4 tasks
[Bugfix][kv_offload] Dedup gpu_block_ids in eager-mode SCO (store + load paths) bug Something isn't working v1
#42903 opened May 18, 2026 by alexbi29 Loading…
4 tasks done
RISC-V ILP Optimization: Add instruction-level parallelism for transcendental functions cpu Related to CPU backends documentation Improvements or additions to documentation performance Performance-related issues
#42900 opened May 17, 2026 by mohankku Contributor Loading…
add cutedsl dsv4 indexer fp8 kernel v1
#42899 opened May 17, 2026 by gnovack Contributor Loading…
Support Nomic-embed-text-v1 with transformers v5
#42894 opened May 17, 2026 by ieBoytsov Contributor Loading…
2 of 4 tasks
[ROCm][DSv4] Functional fixes for DeepSeek V4 on MI300X (gfx942) deepseek Related to DeepSeek models rocm Related to AMD ROCm v1
#42893 opened May 17, 2026 by maeehart Contributor Loading…
[KV Events] Switch event structs from array to map encoding documentation Improvements or additions to documentation
#42892 opened May 17, 2026 by sagearc Contributor Loading…
Remove Pydantic v2.11 workaround: simplify Mistral tokenizer tool call handling cpu Related to CPU backends documentation Improvements or additions to documentation frontend mistral Related to Mistral models performance Performance-related issues
#42891 opened May 17, 2026 by mohankku Contributor Loading…
4 tasks done
Support nvfp4 kv with kv-cache-dtype-skip-layers sliding_window v1
#42890 opened May 17, 2026 by sychen52 Contributor Loading…
4 tasks
[Refactor] Remove dead code ready ONLY add when PR is ready to merge/full CI is needed
#42889 opened May 17, 2026 by yewentao256 Member Loading…
[Model Runner v2] fix pd accuracy bug Something isn't working kv-connector ready ONLY add when PR is ready to merge/full CI is needed
#42888 opened May 17, 2026 by ZJY0516 Member Loading…
4 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.