Skip to content

Commit f18cbdf

Browse files
wip
1 parent d71767b commit f18cbdf

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

vllm/v1/core/sched/scheduler.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -476,7 +476,11 @@ def schedule(self) -> SchedulerOutput:
476476
# Apply dynamic token budget constraints
477477
effective_budget = self.get_dynamic_token_budget(request, token_budget)
478478
num_new_tokens = min(num_new_tokens, effective_budget)
479-
assert num_new_tokens > 0
479+
# assert num_new_tokens > 0
480+
if num_new_tokens == 0:
481+
self.waiting.pop_request()
482+
skipped_waiting_requests.prepend_request(request)
483+
continue
480484

481485
# Schedule encoder inputs.
482486
if request.has_encoder_inputs:

0 commit comments

Comments
 (0)