[Bugfix] Fix wrong CLI defaults for dynamic `SchedulerConfig` fields #28872

DarkLight1337 · 2025-11-17T17:37:33Z

Purpose

Fix an issue caused by #28665 because the CLI still uses the defaults from SchedulerConfig. Sorry for breaking this!

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <[email protected]>

gemini-code-assist

Code Review

This pull request correctly fixes a bug where CLI arguments for dynamic SchedulerConfig fields (max_num_batched_tokens, max_num_seqs, enable_chunked_prefill) were using static defaults from SchedulerConfig, preventing the intended dynamic default logic from executing. By overriding the default to None for these CLI arguments, the engine can now correctly apply dynamic defaults based on the usage context and hardware. The changes are clear and address the issue effectively. I've also pointed out a similar issue with async_scheduling that could be addressed for consistency.

gemini-code-assist · 2025-11-17T17:48:42Z

vllm/engine/arg_utils.py

+            "--max-num-batched-tokens",
+            **{
+                **scheduler_kwargs["max_num_batched_tokens"],
+                "default": None,
+            },
        )
        scheduler_group.add_argument(
-            "--max-num-seqs", **scheduler_kwargs["max_num_seqs"]
+            "--max-num-seqs",
+            **{
+                **scheduler_kwargs["max_num_seqs"],
+                "default": None,
+            },


While you're fixing the CLI defaults for dynamic SchedulerConfig fields, it seems like async_scheduling might be another case that needs a similar change.

Currently, EngineArgs.async_scheduling defaults to False (from SchedulerConfig.async_scheduling), and the CLI argument also defaults to False. This means the logic in VllmConfig.__post_init__ to dynamically enable async_scheduling (where self.scheduler_config.async_scheduling is None) will never be triggered.

To enable the dynamic default behavior for async_scheduling, you might need to:

Change the default value of async_scheduling in EngineArgs to None.

Update its add_argument call in add_cli_args to set default=None, similar to the other fields in this PR.

This would make its behavior consistent with the other dynamically configured scheduler arguments.

ywang96 · 2025-11-17T19:58:14Z

Looking at the failing test - it seems that max_num_batched_tokens is still using the same value as max_model_len?

[2025-11-17T18:40:57Z] INFO 11-17 10:40:57 [model.py:1745] Using max model len 448
...
[2025-11-17T18:41:25Z] (EngineCore_DP0 pid=5910) ValueError: Chunked MM input disabled but max_tokens_per_mm_item (1500) is larger than max_num_batched_tokens (448). Please increase max_num_batched_tokens.

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 · 2025-11-18T03:22:23Z

It is caused by this code, which seems intentional. cc @NickLucche @russellb

            # When using default settings,
            # Ensure max_num_batched_tokens does not exceed model limit.
            # Some models (e.g., Whisper) have embeddings tied to max length.
            self.max_num_batched_tokens = min(
                self.max_num_seqs * model_config.max_model_len,
                self.max_num_batched_tokens,
            )

For now I have worked around it by increasing max_num_seqs

DarkLight1337 · 2025-11-18T04:30:21Z

The previously failing test now passes, merging as the other entrypoint test is failing on main

…llm-project#28872) Signed-off-by: DarkLight1337 <[email protected]>

[Bugfix] Fix wrong CLI defaults for dynamic SchedulerConfig fields

41dcf4d

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 requested review from WoosukKwon and hmellor November 17, 2025 17:37

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 17, 2025

WoosukKwon approved these changes Nov 17, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) November 17, 2025 17:44

gemini-code-assist bot reviewed Nov 17, 2025

View reviewed changes

mgoin added the bug Something isn't working label Nov 17, 2025

ProExpertProg approved these changes Nov 17, 2025

View reviewed changes

Fix

5bcc686

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 requested review from NickLucche, aarnphm and robertgshaw2-redhat as code owners November 18, 2025 03:11

vllm-bot merged commit bf9e1e8 into vllm-project:main Nov 18, 2025
39 of 47 checks passed

DarkLight1337 deleted the fix-serve-defaults branch November 18, 2025 04:30

DarkLight1337 added this to the v0.11.1 milestone Nov 18, 2025

Victor49152 pushed a commit to Victor49152/vllm that referenced this pull request Nov 20, 2025

[Bugfix] Fix wrong CLI defaults for dynamic SchedulerConfig fields (v…

a07ec2e

…llm-project#28872) Signed-off-by: DarkLight1337 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix wrong CLI defaults for dynamic `SchedulerConfig` fields #28872

[Bugfix] Fix wrong CLI defaults for dynamic `SchedulerConfig` fields #28872

DarkLight1337 commented Nov 17, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 17, 2025

Uh oh!

ywang96 commented Nov 17, 2025

Uh oh!

DarkLight1337 commented Nov 18, 2025 •

edited

Loading

Uh oh!

DarkLight1337 commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

[Bugfix] Fix wrong CLI defaults for dynamic SchedulerConfig fields #28872

[Bugfix] Fix wrong CLI defaults for dynamic SchedulerConfig fields #28872

Conversation

DarkLight1337 commented Nov 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

ywang96 commented Nov 17, 2025

Uh oh!

DarkLight1337 commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DarkLight1337 commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[Bugfix] Fix wrong CLI defaults for dynamic `SchedulerConfig` fields #28872

[Bugfix] Fix wrong CLI defaults for dynamic `SchedulerConfig` fields #28872

DarkLight1337 commented Nov 17, 2025 •

edited by github-actions bot

Loading

DarkLight1337 commented Nov 18, 2025 •

edited

Loading