[Models] Use in-place adds in Idefics2Vision #23932

lgeiger · 2025-08-29T11:20:17Z

Purpose

This changes the Idefics2VisionTransformer model to use in-place adds where possible. This is relevant since vision encoders currently run outside of torch compile so things like this won't be automatically optimised away. See also #18922.

Related to: #23884

Test Plan

vllm serve openbmb/MiniCPM-V-4_5 --trust-remote-code

Test Result

In an internal benchmark with many image inputs this has a surprisingly big performance impact. I'm seeing a 5.5% - 6.2% increase in throughput with the openbmb/MiniCPM-V-4_5 model running on a L40s GPU.

@DarkLight1337 let me know if you'd like me to do a bit of search and replace and apply similar changes in the other model definitions as well which might also benefit text only models when run in eager mode.

gemini-code-assist

Code Review

This pull request introduces a performance optimization in Idefics2VisionTransformer by replacing out-of-place additions with in-place additions. The changes in Idefics2VisionEmbeddings and Idefics2EncoderLayer are correct and safe, as they operate on locally-scoped tensors, avoiding side effects. This modification should lead to reduced memory allocation and improved throughput as described in the pull request, which is a valuable improvement for inference performance. The changes are well-implemented. As you suggested in the description, applying this optimization to other models in the codebase would likely be beneficial as well.

DarkLight1337 · 2025-08-29T11:31:49Z

Thanks for the optimization. Let's open a separate PR with a benchmark for each model

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

lgeiger · 2025-08-29T12:11:18Z

Rebased to re-trigger CI...

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

gemini-code-assist bot reviewed Aug 29, 2025

View reviewed changes

DarkLight1337 approved these changes Aug 29, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) August 29, 2025 11:32

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 29, 2025

[Models] Use in-place adds in Idefics2Vision

c727b73

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

auto-merge was automatically disabled August 29, 2025 12:11
Head branch was pushed to by a user without write access

lgeiger force-pushed the inplace-adds-idefics branch from 4c086b6 to c727b73 Compare August 29, 2025 12:11

jeejeelee approved these changes Aug 29, 2025

View reviewed changes

vllm-bot merged commit 0a2f4c0 into vllm-project:main Aug 29, 2025
38 of 41 checks passed

lgeiger deleted the inplace-adds-idefics branch August 29, 2025 14:45

lgeiger mentioned this pull request Sep 9, 2025

[Models] Prefer in-place add and multiply #24500

Open

eicherseiji pushed a commit to eicherseiji/vllm that referenced this pull request Sep 9, 2025

[Models] Use in-place adds in Idefics2Vision (vllm-project#23932)

307cc7e

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Models] Use in-place adds in Idefics2Vision (vllm-project#23932)

1f928e1

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

DarkLight1337 mentioned this pull request Sep 26, 2025

[Misc][qwen2_5_vl][torch.compile] Enable supports_torch_compile on generic nn.Module and demonstrate speedup on Qwen Vision model #23207

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Models] Use in-place adds in Idefics2Vision #23932

[Models] Use in-place adds in Idefics2Vision #23932

Uh oh!

lgeiger commented Aug 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 commented Aug 29, 2025

Uh oh!

lgeiger commented Aug 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Models] Use in-place adds in Idefics2Vision #23932

[Models] Use in-place adds in Idefics2Vision #23932

Uh oh!

Conversation

lgeiger commented Aug 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 commented Aug 29, 2025

Uh oh!

lgeiger commented Aug 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lgeiger commented Aug 29, 2025 •

edited by github-actions bot

Loading