Skip to content

skip vision parts of the model for test_train_vlm_multi_image as well#5774

Open
kaixuanliu wants to merge 1 commit into
huggingface:mainfrom
kaixuanliu:grpo_ci_fix
Open

skip vision parts of the model for test_train_vlm_multi_image as well#5774
kaixuanliu wants to merge 1 commit into
huggingface:mainfrom
kaixuanliu:grpo_ci_fix

Conversation

@kaixuanliu
Copy link
Copy Markdown
Contributor

@kaixuanliu kaixuanliu commented May 15, 2026

We need to skip vision part of the model in multi-image test case, just like what we did in test_train_vlm, pls help review, @qgallouedec @kashif


Note

Low Risk
Low risk: test-only change that relaxes parameter-update assertions for known non-updating vision components in tiny VLMs.

Overview
Aligns test_train_vlm_multi_image with test_train_vlm by skipping parameter-change assertions for vision-related modules (e.g., vision_tower, projector, visual, image_newline) where tiny model initialization can prevent gradient flow.

This reduces flaky/false-negative failures in the multi-image VLM training test while still asserting that the rest of the model updates during trainer.train().

Reviewed by Cursor Bugbot for commit 88856bd. Bugbot is set up for automated code reviews on this repo. Configure here.

Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>
@qgallouedec
Copy link
Copy Markdown
Member

qgallouedec commented May 15, 2026

test_train_vlm_multi_image does pass, I don't understand why we should skip some params

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants