[Bugfix] Multi-modal caches not acting like LRU caches by DarkLight1337 · Pull Request #16593 · vllm-project/vllm

DarkLight1337 · 2025-04-14T13:22:34Z

LRUCache is not updated by __contains__. So, we should call get directly.

Also, fix __getitem__ not updating cache stats

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

github-actions · 2025-04-14T13:22:46Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

…16593) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Yang Wang <elainewy@meta.com>

…16593) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

…16593) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

[Bugfix] Multi-modal caches not acting like LRU caches

a906993

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 14, 2025

DarkLight1337 requested a review from Isotr0py April 14, 2025 13:22

DarkLight1337 requested review from WoosukKwon, alexm-redhat, comaniac, njhill, robertgshaw2-redhat and ywang96 as code owners April 14, 2025 13:22

DarkLight1337 mentioned this pull request Apr 14, 2025

[Metrics] Log multi-modal cache stats #16478

Closed

mergify bot added the v1 label Apr 14, 2025

DarkLight1337 mentioned this pull request Apr 14, 2025

[Bugfix] Avoid transferring cached multi-modal items from P0 to P1 #16273

Merged

Isotr0py approved these changes Apr 14, 2025

View reviewed changes

DarkLight1337 added 2 commits April 14, 2025 14:53

Update cache and tests

8420d58

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Try fix pre-commit

8108c36

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 mentioned this pull request Apr 14, 2025

[BUG] MM LRU cache fixing #16599

Closed

Fix

931d094

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

vllm-bot merged commit aa29841 into vllm-project:main Apr 14, 2025
39 of 47 checks passed

DarkLight1337 deleted the fix-mm-cache branch April 14, 2025 16:24

DarkLight1337 mentioned this pull request Apr 19, 2025

[Bug]: KeyError in mm_input_cache when processing multimodal requests with Qwen2.5-VL-72B #16875

Closed

1 task

yangw-dev pushed a commit to yangw-dev/vllm that referenced this pull request Apr 21, 2025

[Bugfix] Multi-modal caches not acting like LRU caches (vllm-project#…

9faf3e0

…16593) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Yang Wang <elainewy@meta.com>

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025

[Bugfix] Multi-modal caches not acting like LRU caches (vllm-project#…

31d9540

…16593) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Bugfix] Multi-modal caches not acting like LRU caches (vllm-project#…

5cf9b53

…16593) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 mentioned this pull request May 8, 2025

[Bug]: mm_cache keyerror #16903

Closed

1 task

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

DarkLight1337 mentioned this pull request Jun 28, 2025

[Bug]: Llama4 multimodal cache missing key #20203

Closed

1 task

DarkLight1337 mentioned this pull request Jul 8, 2025

[Bug]: VLLM unexpectedly exited after serve due to cache hashkey #17227

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Multi-modal caches not acting like LRU caches#16593

[Bugfix] Multi-modal caches not acting like LRU caches#16593
vllm-bot merged 4 commits intovllm-project:mainfrom
DarkLight1337:fix-mm-cache

DarkLight1337 commented Apr 14, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

DarkLight1337 commented Apr 14, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DarkLight1337 commented Apr 14, 2025 •

edited by github-actions bot

Loading