[Multimodal] Always enable hashing mm data #23308

ywang96 · 2025-08-21T04:11:48Z

Purpose

Hashing MM data is required for prefix caching & feature caching to work with multimodal models and has always been turned on by default since vLLM V1 is released. Since then the community is generally happy with these two features and there's no real reason why a user would want to turn off both of them.

This PR modifies the internal so that now multimodal data is always hashed and mm_hashes will always be present whenever mm_data is. This also facilitates #22711.

A follow-up PR will be allowing users to pass in mm_hashes themselves from the frontend, as suggested in #22044

Test Plan

Test Result

(Optional) Documentation Update

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Signed-off-by: Roger Wang <hey@rogerw.io>

github-actions · 2025-08-21T04:11:56Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

vllm/inputs/preprocess.py

vllm/v1/engine/processor.py

Signed-off-by: Roger Wang <hey@rogerw.io>

ywang96 · 2025-08-21T07:00:27Z

Seems we need to move embedding inputs to cpu before numpy transferring explicitly (https://buildkite.com/vllm/ci/builds/27839#0198caf0-8e8d-479c-8bb8-91308921a501):

vllm/vllm/multimodal/hasher.py

Lines 45 to 46 in 3128240

if isinstance(obj, torch.Tensor):

return cls.item_to_bytes("tensor", obj.numpy())

Yep - done in 08acd67

vllm/model_executor/models/prithvi_geospatial_mae.py

Signed-off-by: Roger Wang <hey@rogerw.io>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-08-21T14:23:18Z

Failing tests are due to connection error -- force-merging

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Danila Kirichko <d.kirichko@mts.ai>

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Duncan Moss <djm.moss@gmail.com>

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: root <xwq391974@alibaba-inc.com>

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Xiao Yu <xiao.yu@amd.com>

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>

update

279908b

Signed-off-by: Roger Wang <hey@rogerw.io>

mergify bot added deepseek Related to DeepSeek models llama Related to Llama models multi-modality Related to multi-modality (#4194) v1 labels Aug 21, 2025

ywang96 mentioned this pull request Aug 21, 2025

[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests #22711

Merged

4 tasks

Merge branch 'main' into mm-hash

ee6b123

ywang96 marked this pull request as ready for review August 21, 2025 04:20

ywang96 requested review from DarkLight1337, WoosukKwon, alexm-redhat, comaniac, njhill, patrickvonplaten and robertgshaw2-redhat as code owners August 21, 2025 04:20

ywang96 requested review from Isotr0py and removed request for WoosukKwon, alexm-redhat, comaniac, njhill, patrickvonplaten and robertgshaw2-redhat August 21, 2025 04:22

DarkLight1337 reviewed Aug 21, 2025

View reviewed changes

vllm/inputs/preprocess.py Outdated Show resolved Hide resolved

vllm/v1/engine/processor.py Show resolved Hide resolved

vllm/v1/engine/processor.py Show resolved Hide resolved

comments

44526f5

Signed-off-by: Roger Wang <hey@rogerw.io>

ywang96 requested review from houseroad, mgoin, simon-mo, tlrmchlsmth and youkaichao as code owners August 21, 2025 04:27

DarkLight1337 reviewed Aug 21, 2025

View reviewed changes

vllm/model_executor/models/prithvi_geospatial_mae.py Show resolved Hide resolved

ywang96 and others added 4 commits August 21, 2025 08:58

update

79e5641

Signed-off-by: Roger Wang <hey@rogerw.io>

update

33c4b51

Signed-off-by: Roger Wang <hey@rogerw.io>

cleanup

b0a1f95

Signed-off-by: Roger Wang <hey@rogerw.io>

Fix Prithvi

71ab9d5

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

vllm-bot merged commit 79f05e4 into vllm-project:main Aug 21, 2025
39 of 41 checks passed

ywang96 mentioned this pull request Aug 22, 2025

[Core][Multimodal] Allow passing multi_modal_uuids as multimodal identifiers. #23394

Merged

4 tasks

ywang96 mentioned this pull request Aug 26, 2025

[Multimodal] Generate mm_hash based on request metadata when caching is turned off #23690

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Multimodal] Always enable hashing mm data #23308

[Multimodal] Always enable hashing mm data #23308

Uh oh!

ywang96 commented Aug 21, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Aug 21, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ywang96 commented Aug 21, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Aug 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Multimodal] Always enable hashing mm data #23308

[Multimodal] Always enable hashing mm data #23308

Uh oh!

Conversation

ywang96 commented Aug 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

github-actions bot commented Aug 21, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ywang96 commented Aug 21, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Aug 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ywang96 commented Aug 21, 2025 •

edited by github-actions bot

Loading