[Chore] Cleanup guided namespace, move to structured outputs config #22772

aarnphm · 2025-08-13T00:44:57Z

Continuation of #17420

This PR introduces the args --structured-output-config as a way to unify all related structured outputs config in one CLI field.
This would help simplify general UX for specifying custom options with backends.

I also remove all previous guided_decoding options

This would also be considered breaking. There will be no --guided-decoding-* option anymore. Instead, you should use --structured-outputs-config '{...}' or --structured-outputs-config.backend outlines

Signed-off-by: Aaron Pham [email protected]
Signed-off-by: Harry Mellor [email protected]
Co-authored-by: Nick Hill [email protected]
Co-authored-by: Harry Mellor [email protected]

github-actions · 2025-08-13T00:45:18Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mergify · 2025-08-13T00:45:35Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @aarnphm.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Aaron Pham <[email protected]>

…llm-project#2907) ### What this PR does / why we need it? 1. This pr bump vllm commit to vllm-project/vllm@6d8246a 2. fix upstream changes vllm-project/vllm#24548 abort multi-modal kwargs, make vllm main and `v0.10.2` both adaptable 3. fix metadata_builder changes introduced by vllm-project/vllm#23693 4. fix `structured_outputs_config` changes introduced by vllm-project/vllm#22772 5. fix `moe_config` changes introduced by vllm-project/vllm#22537 Co-authored-by: MengqingCao <[email protected]> Co-authored-by: Yikun Jiang <[email protected]> - vLLM version: v0.10.2 - vLLM main: vllm-project/vllm@c60e613 --------- Signed-off-by: wangli <[email protected]> Signed-off-by: MengqingCao <[email protected]> Co-authored-by: MengqingCao <[email protected]>

They have been removed in vllm-project/vllm#25117 and vllm-project/vllm#22772, thus failing in trunk at the moment after the latest pin commit update Pull Request resolved: pytorch#163383 Approved by: https://github.com/wdvr, https://github.com/seemethere, https://github.com/malfet

…llm-project#2907) ### What this PR does / why we need it? 1. This pr bump vllm commit to vllm-project/vllm@6d8246a 2. fix upstream changes vllm-project/vllm#24548 abort multi-modal kwargs, make vllm main and `v0.10.2` both adaptable 3. fix metadata_builder changes introduced by vllm-project/vllm#23693 4. fix `structured_outputs_config` changes introduced by vllm-project/vllm#22772 5. fix `moe_config` changes introduced by vllm-project/vllm#22537 Co-authored-by: MengqingCao <[email protected]> Co-authored-by: Yikun Jiang <[email protected]> - vLLM version: v0.10.2 - vLLM main: vllm-project/vllm@c60e613 --------- Signed-off-by: wangli <[email protected]> Signed-off-by: MengqingCao <[email protected]> Co-authored-by: MengqingCao <[email protected]> Signed-off-by: Che Ruan <[email protected]>

They have been removed in vllm-project/vllm#25117 and vllm-project/vllm#22772, thus failing in trunk at the moment after the latest pin commit update Pull Request resolved: pytorch#163383 Approved by: https://github.com/wdvr, https://github.com/seemethere, https://github.com/malfet

simon-mo · 2025-09-22T20:34:07Z

@aarnphm can we add backward compatibility for one version so people know how to migrate?

hmellor · 2025-09-22T22:14:06Z

BC for GuidedDecodingParams added in #25422

BC for CLI was already maintained in this PR

The onlt other area we may want to consider is BC in the server API (not sure how we want to handle that)

russellb · 2025-09-23T15:44:36Z

I think backwards compatibility in the server API is critical.

https://docs.vllm.ai/en/latest/contributing/deprecation_policy.html#overview

hmellor · 2025-09-23T16:09:16Z

I can make a similar PR ensuring BC for the server API. However I'm not sure how we can warn a user of the deprecated API that it's deprecated?

russellb · 2025-09-23T17:10:45Z

I can make a similar PR ensuring BC for the server API. However I'm not sure how we can warn a user of the deprecated API that it's deprecated?

Only in docs for the first step. The next step for the HTTP API would be to turn it off by default, but make an option for turning it back on. Then finally, remove it completely.

This was a mistake introduced by vllm-project#22772. Structured output requests were not actually working because the format spec was not placed in the proper new location in the request body. Signed-off-by: Russell Bryant <[email protected]>

Culprit commit: vllm-project/vllm#22772 --------- Signed-off-by: Agata Dobrzyniewicz <[email protected]> Signed-off-by: slokesha <[email protected]>

hmellor · 2025-09-24T22:10:42Z

I've just created #25615 to add BC for the server API too

…llm-project#22772) Signed-off-by: Aaron Pham <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]>

…llm-project#22772) Signed-off-by: Aaron Pham <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Signed-off-by: charlifu <[email protected]>

They have been removed in vllm-project/vllm#25117 and vllm-project/vllm#22772, thus failing in trunk at the moment after the latest pin commit update Pull Request resolved: pytorch#163383 Approved by: https://github.com/wdvr, https://github.com/seemethere, https://github.com/malfet

They have been removed in vllm-project/vllm#25117 and vllm-project/vllm#22772, thus failing in trunk at the moment after the latest pin commit update Pull Request resolved: #163383 Approved by: https://github.com/wdvr, https://github.com/seemethere, https://github.com/malfet (cherry picked from commit a31acf3)

Clean up obsoleted vLLM tests (#163383) They have been removed in vllm-project/vllm#25117 and vllm-project/vllm#22772, thus failing in trunk at the moment after the latest pin commit update Pull Request resolved: #163383 Approved by: https://github.com/wdvr, https://github.com/seemethere, https://github.com/malfet (cherry picked from commit a31acf3) Co-authored-by: Huy Do <[email protected]>

…llm-project#22772) Signed-off-by: Aaron Pham <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…llm-project#22772) Signed-off-by: Aaron Pham <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]>

…llm-project#2907) ### What this PR does / why we need it? 1. This pr bump vllm commit to vllm-project/vllm@6d8246a 2. fix upstream changes vllm-project/vllm#24548 abort multi-modal kwargs, make vllm main and `v0.10.2` both adaptable 3. fix metadata_builder changes introduced by vllm-project/vllm#23693 4. fix `structured_outputs_config` changes introduced by vllm-project/vllm#22772 5. fix `moe_config` changes introduced by vllm-project/vllm#22537 Co-authored-by: MengqingCao <[email protected]> Co-authored-by: Yikun Jiang <[email protected]> - vLLM version: v0.10.2 - vLLM main: vllm-project/vllm@c60e613 --------- Signed-off-by: wangli <[email protected]> Signed-off-by: MengqingCao <[email protected]> Co-authored-by: MengqingCao <[email protected]>

…llm-project#22772) Signed-off-by: Aaron Pham <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

aarnphm requested review from DarkLight1337, WoosukKwon, alexm-redhat, comaniac, hmellor, mgoin, njhill, patrickvonplaten, robertgshaw2-redhat, russellb, simon-mo, youkaichao, ywang96 and zhuohan123 as code owners August 13, 2025 00:44

mergify bot added documentation Improvements or additions to documentation frontend performance Performance-related issues structured-output v1 labels Aug 13, 2025

mergify bot added the tool-calling label Aug 13, 2025

github-project-automation bot added this to Structured Output Aug 13, 2025

mergify bot added the needs-rebase label Aug 13, 2025

github-project-automation bot added this to Tool Calling Aug 13, 2025

chore: finalize cleanup from v0

69068cd

Signed-off-by: Aaron Pham <[email protected]>

aarnphm force-pushed the feat/decoding-args-rename-all branch from d3ac885 to 69068cd Compare August 13, 2025 00:47

aarnphm requested review from houseroad, tlrmchlsmth and yewentao256 as code owners August 13, 2025 00:47

Yikun mentioned this pull request Sep 22, 2025

[Bug]: Fix vllm main issue (0922) vllm-project/vllm-ascend#3083

Open

jiqing-feng mentioned this pull request Sep 22, 2025

update guided decoding param to structured outputs huggingface/trl#4117

Open

qgallouedec mentioned this pull request Sep 22, 2025

📌 Pin vLLM version huggingface/trl#4122

Merged

russellb mentioned this pull request Sep 23, 2025

[Benchmark] Fix regression in structured output benchmark #25500

Merged

This was referenced Sep 24, 2025

Add backward compatibility for GuidedDecodingParams #25422

Merged

Add backward compatibility for guided_... API #25615

Merged

pytorchbot mentioned this pull request Sep 30, 2025

Clean up obsoleted vLLM tests pytorch/pytorch#164282

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Chore] Cleanup guided namespace, move to structured outputs config #22772

[Chore] Cleanup guided namespace, move to structured outputs config #22772

Uh oh!

aarnphm commented Aug 13, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Aug 13, 2025

Uh oh!

mergify bot commented Aug 13, 2025

Uh oh!

simon-mo commented Sep 22, 2025

Uh oh!

hmellor commented Sep 22, 2025

Uh oh!

russellb commented Sep 23, 2025

Uh oh!

hmellor commented Sep 23, 2025

Uh oh!

russellb commented Sep 23, 2025

Uh oh!

hmellor commented Sep 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

[Chore] Cleanup guided namespace, move to structured outputs config #22772

[Chore] Cleanup guided namespace, move to structured outputs config #22772

Uh oh!

Conversation

aarnphm commented Aug 13, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Aug 13, 2025

Uh oh!

mergify bot commented Aug 13, 2025

Uh oh!

simon-mo commented Sep 22, 2025

Uh oh!

hmellor commented Sep 22, 2025

Uh oh!

russellb commented Sep 23, 2025

Uh oh!

hmellor commented Sep 23, 2025

Uh oh!

russellb commented Sep 23, 2025

Uh oh!

hmellor commented Sep 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

aarnphm commented Aug 13, 2025 •

edited by github-actions bot

Loading