Skip to content

Conversation

@njhill
Copy link
Member

@njhill njhill commented Aug 22, 2025

Fixes #23381.

Changes in #19912 mis-assumed that pooling requests only last a single step. But with sequence chunking this might not be the case.

We should have CI tests to cover this, to be added in a follow-on.

Unfortunately this bug is in the v0.10.1 and v0.10.1.1 releases.

Summary of changes:

  • Use BatchUpdateBuilder in InputBatch to track removed requests consistent with non-pooling case, used to fill in empty slots when requests are added in the same step
  • Add batch_changed flag to BatchUpdateBuilder to track whether requests have been added (since the added list is only used for non-pooling reqs with sampling params)
  • Reorder operations in input batch condense() and swap_states() so that only the applicable ones are done in the pooling case
  • Other minor code cleanup changes

Signed-off-by: Nick Hill <[email protected]>
@njhill njhill marked this pull request as ready for review August 22, 2025 05:04
@njhill njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 22, 2025
@WoosukKwon
Copy link
Collaborator

@maxdebayser Could you please review?

Copy link
Contributor

@maxdebayser maxdebayser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reading the code it seems to me that this should fix the bug. I've already opened an issue to add CI tests for this case as a follow up: #23436

@DarkLight1337 DarkLight1337 merged commit c80c53a into vllm-project:main Aug 23, 2025
46 checks passed
@njhill njhill deleted the fix-pooling-batch branch August 23, 2025 00:34
epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025
xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025
zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Pooling models fail with V1 logits processors

4 participants