[openai] Fix missing tool usage check (system message) #24768

levunet · 2025-09-12T17:54:39Z

Purpose

Fixed a bug where the commentary value was missing in Invalid Channel due to the absence of with_custom_tools value when fetching the system message.

gemini-code-assist

Code Review

This pull request addresses a bug that prevented tool usage with gpt-oss models. The issue was caused by the commentary channel being incorrectly removed from the system message, which is necessary for tool call functionality. The fix involves passing a with_custom_tools flag to the get_system_message function, determined by whether tools are present in the request. This change correctly preserves the commentary channel when tools are used. The fix is straightforward and effectively resolves the bug.

alecsolder · 2025-09-15T15:11:09Z

vllm/entrypoints/openai/serving_chat.py

Looks good to me, it would be great to add a unit test for this, there are some examples here the test likely doesn't even need the model to run, just that the messages are being generated properly here.

A small nit would be that this will activate the commentary channel even if an empty list of tools is passed in, which is not uncommon. We could avoid that by using some like bool(request.tools) instead of request.tools is not None. But, I separately discovered this issue, Alec pointed me to this PR, and agree that this fix is needed to get accurate tool calling in Chat Completions with gpt-oss models.

Without this PR, I regularly see the models outputting non-built-in tool calls to the analysis channel, which isn't where they should go.

chaunceyjiang

LGTM

Fixed a bug where the commentary value was missing in Invalid Channel due to the absence of with_custom_tools value when fetching the system message. Signed-off-by: kyt <[email protected]>

bbrowning · 2025-09-26T14:36:01Z

This PR substantially improves Chat Completions streaming tool call handling for Harmony models, especially for gpt-oss-20b. Without this PR, the 20b model often (and 120b sometimes) outputs tool calls to the analysis channel, which is wrong. We could adjust our streaming parser code to handle that, which is a fairly trivial change. However, the more appropriate fix is this PR, which activates the commentary channel when the request contains tools.

It may not reduce 100% of cases where tool calls inappropriately go to the analysis channel, but it makes a massive improvement.

levunet · 2025-09-26T16:05:39Z

@bbrowning
There are additional PRs that have modified the content you mentioned. I believe the harmony library improvements have resolved the issue close to 100%. If you need it, I recommend using that harmony code.

#24954
openai/harmony#76

aarnphm

Hopefully this fixes. Thanks for this!

bbrowning · 2025-09-27T11:10:37Z

Just a note that the CI failures look unrelated, as both are failing in a disk space check on the docker build jobs. This PR doesn't change anything that would impact free disk space on the builder nodes.

lorenzocollodi · 2025-09-30T09:10:10Z

Any updates on this? Or has it been fixed somewhere else?

borishim · 2025-09-30T12:25:12Z

I can confirm that by applying openai/harmony#76, #24768 and #24954 , codex works very well with gpt-oss-120b.

PedroMiolaSilva · 2025-10-01T18:05:52Z

Hey guys, any news here? Looking forward for this merge!

DarkLight1337 · 2025-10-03T10:55:49Z

Sorry for the delay, merging now

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: Rahul Tuli <[email protected]>

Signed-off-by: kyt <[email protected]> Signed-off-by: yewentao256 <[email protected]>

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: Tomer Asida <[email protected]>

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: Karan Goel <[email protected]>

…24768) Signed-off-by: kyt <[email protected]>

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

bfroemel · 2025-10-10T11:00:03Z

I can confirm that by applying openai/harmony#76, #24768 and #24954 , codex works very well with gpt-oss-120b.

@borishim could you share how you made that work besides applying the PRs? Is this with "--tool-call-parser openai --enable-auto-tool-choice" and responses API? Many thanks!

borishim · 2025-10-10T11:11:16Z

I can confirm that by applying openai/harmony#76, #24768 and #24954 , codex works very well with gpt-oss-120b.

@borishim could you share how you made that work besides applying the PRs? Is this with "--tool-call-parser openai --enable-auto-tool-choice" and responses API? Many thanks!

Here are the switches I used:
--tool-call-parser openai --reasoning-parser openai-gptoss --enable-auto-tool-choice --max-model-len 131072

Note that I applied the patches against 0.10.2 docker images. Also, current codex cli expects chat completion API for gpt-oss. So I think that you should use chat completion API.

levunet · 2025-10-10T13:32:45Z

@bfroemel @borishim

I found a bug in vllm 0.11.0 where, after installing FlashInfer and using it by default for sampling, specific sentences would repeat infinitely or fail to generate responses during answer generation.

If you need it, you can use the following method and it will work normally:

VLLM_USE_FLASHINFER_SAMPLER=0

Also, I think this might be a minor mistake, but you should probably use 'openai_gptoss' for the reasoning-parser.

…24768) Signed-off-by: kyt <[email protected]>

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Sync to upstream's [v0.11.0](https://github.com/vllm-project/vllm/releases/tag/v0.11.0) release + a cherry pick of vllm-project/vllm#24768 This PR targets CUDA but may also be sufficient for ROCM. Dockerfile updates: - general updates to match upstream's Dockerfile - nvcc, nvrtc and cuobjdump were addded for deepgemm JIT requirementes: neuralmagic/nm-vllm-ent@2a545c8 - missing paths were added for triton JIT: neuralmagic/nm-vllm-ent@b3027fc Tests: Branch in nm-cicd: https://github.com/neuralmagic/nm-cicd/tree/sync-v0.11-cuda accept-sync: https://github.com/neuralmagic/nm-cicd/actions/runs/18270550524 -- please ignore unit tests, they need to be updated to v1. Image tested: quay.io/vllm/automation-vllm:cuda-18270550524 Image validation: https://github.com/neuralmagic/nm-cicd/actions/runs/18271507914 Whisper runs: https://github.com/neuralmagic/nm-cicd/actions/runs/18281815955/job/52046560584 https://github.com/neuralmagic/nm-cicd/actions/runs/18281511979

levunet requested review from aarnphm and chaunceyjiang as code owners September 12, 2025 17:54

mergify bot added the frontend label Sep 12, 2025

gemini-code-assist bot reviewed Sep 12, 2025

View reviewed changes

levunet force-pushed the feat/fix_tool_check branch from 04938e5 to ecafa3c Compare September 12, 2025 18:01

levunet mentioned this pull request Sep 13, 2025

[gpt-oss] Fix: No response when using stream & tools #24473

Closed

alecsolder reviewed Sep 15, 2025

View reviewed changes

levunet mentioned this pull request Sep 16, 2025

[Bug] Fix gpt-oss missing tool content #24954

Open

levunet force-pushed the feat/fix_tool_check branch from ecafa3c to f9043ac Compare September 16, 2025 12:11

levunet requested review from DarkLight1337, NickLucche, robertgshaw2-redhat and simon-mo as code owners September 16, 2025 12:11

mergify bot added the gpt-oss Related to GPT-OSS models label Sep 16, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Sep 16, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Sep 16, 2025

chaunceyjiang approved these changes Sep 16, 2025

View reviewed changes

github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Sep 16, 2025

chaunceyjiang added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 16, 2025

levunet force-pushed the feat/fix_tool_check branch from 9f989c4 to 38382b4 Compare September 16, 2025 13:36

[Bug] Fix missing tool usage check

33cc22e

Fixed a bug where the commentary value was missing in Invalid Channel due to the absence of with_custom_tools value when fetching the system message. Signed-off-by: kyt <[email protected]>

levunet force-pushed the feat/fix_tool_check branch from f42b60e to 33cc22e Compare September 16, 2025 15:18

bbrowning mentioned this pull request Sep 26, 2025

[Bugfix] [Frontend] Cleanup gpt-oss non-streaming chat tool calls #25514

Merged

bbrowning mentioned this pull request Sep 26, 2025

[Feature]: If I want gpt-oss to be able to call custom tools, how should I set the --tool-call-parser parameter during deployment? #22308

Open

1 task

levunet mentioned this pull request Sep 26, 2025

fix: Resolve response format corruption due to incorrect encoding openai/harmony#76

Open

aarnphm approved these changes Sep 27, 2025

View reviewed changes

DarkLight1337 merged commit 2ed3f20 into vllm-project:main Oct 3, 2025
43 checks passed

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Oct 3, 2025

rahul-tuli pushed a commit to neuralmagic/vllm that referenced this pull request Oct 3, 2025

[openai] Fix missing tool usage check (system message) (vllm-project#…

05ff223

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: Rahul Tuli <[email protected]>

This was referenced Oct 3, 2025

[Bug]: GPT-OSS Tool Calls Fail in Stream Mode #26083

Open

[Bug][gpt-oss]: Chat completion no built-in tool result with streaming output #26075

Closed

yewentao256 pushed a commit that referenced this pull request Oct 3, 2025

[openai] Fix missing tool usage check (system message) (#24768)

fa29d31

Signed-off-by: kyt <[email protected]> Signed-off-by: yewentao256 <[email protected]>

tomeras91 pushed a commit to tomeras91/vllm that referenced this pull request Oct 6, 2025

[openai] Fix missing tool usage check (system message) (vllm-project#…

2b4eadc

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: Tomer Asida <[email protected]>

karan pushed a commit to karan/vllm that referenced this pull request Oct 6, 2025

[openai] Fix missing tool usage check (system message) (vllm-project#…

fbc8512

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: Karan Goel <[email protected]>

southfreebird pushed a commit to southfreebird/vllm that referenced this pull request Oct 7, 2025

[openai] Fix missing tool usage check (system message) (vllm-project#…

2538e80

…24768) Signed-off-by: kyt <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[openai] Fix missing tool usage check (system message) (vllm-project#…

ed0e709

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[openai] Fix missing tool usage check (system message) (vllm-project#…

d44a1a3

…24768) Signed-off-by: kyt <[email protected]>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[openai] Fix missing tool usage check (system message) (vllm-project#…

3f3ea47

…24768) Signed-off-by: kyt <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[openai] Fix missing tool usage check (system message) (vllm-project#…

3276a06

…24768) Signed-off-by: kyt <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Uh oh!

[openai] Fix missing tool usage check (system message) #24768

[openai] Fix missing tool usage check (system message) #24768

Uh oh!

Conversation

levunet commented Sep 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

alecsolder Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

bbrowning Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

chaunceyjiang left a comment

Choose a reason for hiding this comment

Uh oh!

bbrowning commented Sep 26, 2025

Uh oh!

levunet commented Sep 26, 2025

Uh oh!

aarnphm left a comment

Choose a reason for hiding this comment

Uh oh!

bbrowning commented Sep 27, 2025

Uh oh!

lorenzocollodi commented Sep 30, 2025

Uh oh!

borishim commented Sep 30, 2025

Uh oh!

PedroMiolaSilva commented Oct 1, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Oct 3, 2025

Uh oh!

bfroemel commented Oct 10, 2025

Uh oh!

borishim commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

levunet commented Oct 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

levunet commented Sep 12, 2025 •

edited by github-actions bot

Loading

borishim commented Oct 10, 2025 •

edited

Loading