[Bugfix] Add support for `<tool_call>` format in streaming mode for XLAM Tool Parser #22769

DevonPeroutky · 2025-08-12T22:57:40Z

Issue

The XLAM Tool Parser Documentation states it supports the following formats:

Direct JSON arrays: Output strings that are JSON arrays starting with [ and ending with ]
Thinking tags: Using <think>...</think> tags containing JSON arrays
Code blocks: JSON in code blocks (json ...)
Tool calls tags: Using [TOOL_CALLS] or <tool_call>...</tool_call> tags

However, the <tool_call>...</tool_call> tag format is broken/not-supported in streaming mode. The only output format that is supported in streaming mode is "Direct JSON arrays".

Additionally, none of the formats support any preamble before the tool call. Example:

Sure, I can help with that! <tool_call>...</tool_call>

This PR fix that as well for all formats.

Proposed Plan

This PR adds support for the <tool_call>...</tool_call> tag output format in streaming mode for the XLAM Tool Parser, while maintaining the existing functionality for every other output format.

Test Plan

I've added additional unit test cases for both synchronous and steaming-mode, to verify all possible XLAM formats are parsed correctly. The existing tests should catch any possible regressions.

Test Results

Local

=============================================================================================================== test session starts ================================================================================================================
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls_no_tools PASSED                                                                                                                                                             [  7%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls[parallel_tool_calls] PASSED                                                                                                                                                 [ 14%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls[single_tool_with_think_tag] PASSED                                                                                                                                          [ 21%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls[single_tool_with_json_code_block] PASSED                                                                                                                                    [ 28%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls[single_tool_with_tool_calls_tag] PASSED                                                                                                                                     [ 35%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls[single_tool_with_tool_call_xml_tags] PASSED                                                                                                                                 [ 42%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls_list_structure[list_structured_tool_call] PASSED                                                                                                                            [ 50%]
tests/tool_use/test_xlam_tool_parser.py::test_preprocess_model_output PASSED                                                                                                                                                                 [ 57%]
tests/tool_use/test_xlam_tool_parser.py::test_streaming_with_list_structure PASSED                                                                                                                                                           [ 64%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls_streaming_incremental[parallel_tool_calls] PASSED                                                                                                                           [ 71%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls_streaming_incremental[single_tool_with_think_tag] PASSED                                                                                                                    [ 78%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls_streaming_incremental[single_tool_with_json_code_block] PASSED                                                                                                              [ 85%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls_streaming_incremental[single_tool_with_tool_calls_tag] PASSED                                                                                                               [ 92%]
tests/tool_use/test_xlam_tool_parser.py::test_extract_tool_calls_streaming_incremental[single_tool_with_tool_call_xml_tags] PASSED                                                                                                           [100%]

================================================================================================================= warnings summary =================================================================================================================
<frozen importlib._bootstrap>:488
  <frozen importlib._bootstrap>:488: DeprecationWarning: builtin type SwigPyPacked has no __module__ attribute

<frozen importlib._bootstrap>:488
  <frozen importlib._bootstrap>:488: DeprecationWarning: builtin type SwigPyObject has no __module__ attribute

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
========================================================================================================== 14 passed, 2 warnings in 2.14s ====

BuildKite

TBD

github-actions · 2025-08-12T22:57:49Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request correctly adds support for the <tool_call> format in streaming mode by leveraging the existing preprocess_model_output method. The changes to the parser logic are sound. The accompanying tests have been updated, but the new streaming test is confusing and could be improved for clarity and correctness. I've provided a suggestion to refactor the test to be more straightforward and representative of a streaming scenario.

tests/tool_use/test_xlam_tool_parser.py

DevonPeroutky · 2025-08-12T23:15:38Z

@simon-mo or @khluu , could I get access to the buildkite so I can unblock the relevant tests for this? Thanks 🙏

DevonPeroutky · 2025-08-25T14:45:21Z

@aarnphm Is there anything else you need from me? Or do I just sit tight?

aarnphm · 2025-08-26T02:08:03Z

@aarnphm Is there anything else you need from me? Or do I just sit tight?

Can u check the CI failure here?

…ming mode with the XLAM tool parser Signed-off-by: Devon Peroutky <devon@kindo.ai>

Signed-off-by: Devon Peroutky <devon@kindo.ai>

…ling Signed-off-by: Devon Peroutky <devon@kindo.ai>

DevonPeroutky · 2025-08-28T12:23:51Z

@aarnphm

All the tests are passing now! I modified the implementation slightly since you first approved it, FYI. However, I've add a lot more comprehensive tests for each possible XLAM format to account for the changes.

DarkLight1337

Thanks and sorry for the delay

…LAM Tool Parser (vllm-project#22769) Signed-off-by: Devon Peroutky <devon@kindo.ai>

DevonPeroutky requested a review from aarnphm as a code owner August 12, 2025 22:57

DevonPeroutky marked this pull request as draft August 12, 2025 22:57

mergify bot added frontend tool-calling labels Aug 12, 2025

github-project-automation bot added this to Tool Calling Aug 12, 2025

gemini-code-assist bot reviewed Aug 12, 2025

View reviewed changes

tests/tool_use/test_xlam_tool_parser.py Outdated Show resolved Hide resolved

DevonPeroutky marked this pull request as ready for review August 12, 2025 23:16

aarnphm approved these changes Aug 13, 2025

View reviewed changes

DevonPeroutky force-pushed the fix-xlam-stream-parser branch from b394d10 to 2c12b4d Compare August 14, 2025 21:25

aarnphm enabled auto-merge (squash) August 26, 2025 22:52

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 26, 2025

[Bugfix] Add support for "<tool_call>...</tool_call>" format in strea…

f03d68f

…ming mode with the XLAM tool parser Signed-off-by: Devon Peroutky <devon@kindo.ai>

auto-merge was automatically disabled August 28, 2025 02:22
Head branch was pushed to by a user without write access

DevonPeroutky force-pushed the fix-xlam-stream-parser branch 2 times, most recently from 0e2a07f to d199a97 Compare August 28, 2025 02:30

Address additional bug in XLAM streaming parser

2ac30f0

Signed-off-by: Devon Peroutky <devon@kindo.ai>

DevonPeroutky force-pushed the fix-xlam-stream-parser branch from d199a97 to ec50893 Compare August 28, 2025 04:29

Fix XLAM Tool Parser to handle a preamble during a streaming tool cal…

26f11d8

…ling Signed-off-by: Devon Peroutky <devon@kindo.ai>

DevonPeroutky force-pushed the fix-xlam-stream-parser branch from ec50893 to 26f11d8 Compare August 28, 2025 04:31

Merge branch 'main' into fix-xlam-stream-parser

c9b9132

DevonPeroutky requested a review from aarnphm August 28, 2025 21:05

DarkLight1337 approved these changes Sep 1, 2025

View reviewed changes

DarkLight1337 merged commit 422e793 into vllm-project:main Sep 1, 2025
39 checks passed

github-project-automation bot moved this to Done in Tool Calling Sep 1, 2025

didier-durand pushed a commit to didier-durand/vllm that referenced this pull request Sep 1, 2025

[Bugfix] Add support for <tool_call> format in streaming mode for X…

3513f39

…LAM Tool Parser (vllm-project#22769) Signed-off-by: Devon Peroutky <devon@kindo.ai>

eicherseiji pushed a commit to eicherseiji/vllm that referenced this pull request Sep 9, 2025

[Bugfix] Add support for <tool_call> format in streaming mode for X…

37ea797

…LAM Tool Parser (vllm-project#22769) Signed-off-by: Devon Peroutky <devon@kindo.ai>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Bugfix] Add support for <tool_call> format in streaming mode for X…

0c99457

…LAM Tool Parser (vllm-project#22769) Signed-off-by: Devon Peroutky <devon@kindo.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Add support for `<tool_call>` format in streaming mode for XLAM Tool Parser #22769

[Bugfix] Add support for `<tool_call>` format in streaming mode for XLAM Tool Parser #22769

Uh oh!

DevonPeroutky commented Aug 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Aug 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

DevonPeroutky commented Aug 12, 2025

Uh oh!

DevonPeroutky commented Aug 25, 2025

Uh oh!

aarnphm commented Aug 26, 2025

Uh oh!

DevonPeroutky commented Aug 28, 2025

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Bugfix] Add support for <tool_call> format in streaming mode for XLAM Tool Parser #22769

[Bugfix] Add support for <tool_call> format in streaming mode for XLAM Tool Parser #22769

Uh oh!

Conversation

DevonPeroutky commented Aug 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Proposed Plan

Test Plan

Test Results

Local

BuildKite

Uh oh!

github-actions bot commented Aug 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

DevonPeroutky commented Aug 12, 2025

Uh oh!

DevonPeroutky commented Aug 25, 2025

Uh oh!

aarnphm commented Aug 26, 2025

Uh oh!

DevonPeroutky commented Aug 28, 2025

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Bugfix] Add support for `<tool_call>` format in streaming mode for XLAM Tool Parser #22769

[Bugfix] Add support for `<tool_call>` format in streaming mode for XLAM Tool Parser #22769

DevonPeroutky commented Aug 12, 2025 •

edited by github-actions bot

Loading