[RL][BugFix] Fix missing tokenizer error for token-in-token-out #23904

22quinn · 2025-08-29T06:38:23Z

Purpose

Token-in-token-out is a common use case for RL. Tokenizer is not needed.
Without this PR the API server would complain:

openai.BadRequestError: Error code: 400 - {'error': {'message': 'Unable to get tokenizer because skip_tokenizer_init is True', 'type': 'BadRequestError', 'param': None, 'code': 400}}

Test Plan

pytest tests/entrypoints/openai/test_token_in_token_out.py

Test Result

Passed

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

gemini-code-assist

Code Review

This pull request addresses a bug where using skip_tokenizer_init would cause a crash when making token-in-token-out requests. The changes correctly handle cases where the tokenizer is not initialized by making it optional in several functions. A new test is also added to cover this scenario. My review found a critical issue where a string prompt would still cause a crash if the tokenizer is not initialized. I've provided a code suggestion to add a check and raise a proper error in this case, improving the robustness of the implementation.

vllm/entrypoints/openai/serving_engine.py

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

vllm/entrypoints/openai/serving_completion.py

Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

DarkLight1337

Thanks, LGTM

tests/entrypoints/openai/test_token_in_token_out.py

vllm/entrypoints/openai/serving_engine.py

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

youkaichao · 2025-08-29T07:27:11Z

tests/entrypoints/openai/test_token_in_token_out.py

+        MODEL_NAME,
+        allow_patterns=["*"],
+        cache_dir="/tmp/qwen3_06b",
+        ignore_patterns=["tokenizer*", "vocab*"])


also ignore the safetensors to accelerate test time.

ignoring safetensors will cause failure, unless we set load format to be dummy.

RuntimeError: Cannot find any model weights with /tmp/qwen3_06b/models--Qwen--Qwen3-0.6B/snapshots/c1899de289a04d12100db370d81485cdf75e47ca

youkaichao · 2025-08-29T07:27:35Z

tests/entrypoints/openai/test_token_in_token_out.py

+from ...utils import RemoteOpenAIServer
+
+MODEL_NAME = "Qwen/Qwen3-0.6B"
+MODEL_PATH = "/tmp/qwen3_06b"


better use a tempdir without hardcoding a path.

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

youkaichao

LGTM, thanks for the fix!

youkaichao · 2025-08-29T17:09:46Z

failures are unrelated, merging.

…-project#23904) Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>

22quinn added 4 commits August 28, 2025 19:49

wip

5bbb33d

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

Merge branch 'main' into no-tokenizer

e1f69d4

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

no more error

ab064b2

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

test

fa33844

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

22quinn requested review from DarkLight1337, aarnphm, robertgshaw2-redhat and simon-mo as code owners August 29, 2025 06:38

22quinn requested review from youkaichao and zhuohan123 August 29, 2025 06:38

mergify bot added the frontend label Aug 29, 2025

gemini-code-assist bot reviewed Aug 29, 2025

View reviewed changes

vllm/entrypoints/openai/serving_engine.py Show resolved Hide resolved

gemini advice

ec146af

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

DarkLight1337 reviewed Aug 29, 2025

View reviewed changes

vllm/entrypoints/openai/serving_completion.py Outdated Show resolved Hide resolved

redundant get_model_config

fb32f72

Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

22quinn added ready ONLY add when PR is ready to merge/full CI is needed rl Related to RL workflows labels Aug 29, 2025

DarkLight1337 approved these changes Aug 29, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) August 29, 2025 06:50

youkaichao reviewed Aug 29, 2025

View reviewed changes

tests/entrypoints/openai/test_token_in_token_out.py Show resolved Hide resolved

youkaichao reviewed Aug 29, 2025

View reviewed changes

vllm/entrypoints/openai/serving_engine.py Show resolved Hide resolved

no tokenizer files in test

6d86fad

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

auto-merge was automatically disabled August 29, 2025 07:15
Head branch was pushed to by a user without write access

fix test

241e45c

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

youkaichao reviewed Aug 29, 2025

View reviewed changes

comments

07f5edc

Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>

youkaichao approved these changes Aug 29, 2025

View reviewed changes

youkaichao enabled auto-merge (squash) August 29, 2025 07:59

youkaichao disabled auto-merge August 29, 2025 17:09

youkaichao merged commit 4d7fe40 into vllm-project:main Aug 29, 2025
37 of 40 checks passed

22quinn deleted the no-tokenizer branch November 16, 2025 22:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[RL][BugFix] Fix missing tokenizer error for token-in-token-out #23904

[RL][BugFix] Fix missing tokenizer error for token-in-token-out #23904

Uh oh!

22quinn commented Aug 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

Uh oh!

youkaichao Aug 29, 2025

Uh oh!

22quinn Aug 29, 2025

Uh oh!

youkaichao Aug 29, 2025

Uh oh!

youkaichao left a comment

Uh oh!

youkaichao commented Aug 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[RL][BugFix] Fix missing tokenizer error for token-in-token-out #23904

[RL][BugFix] Fix missing tokenizer error for token-in-token-out #23904

Uh oh!

Conversation

22quinn commented Aug 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

youkaichao Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

22quinn Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

youkaichao Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

youkaichao left a comment

Choose a reason for hiding this comment

Uh oh!

youkaichao commented Aug 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

22quinn commented Aug 29, 2025 •

edited by github-actions bot

Loading