Skip to content

Commit 196332c

Browse files
zhangyuygss开哲
authored andcommitted
[Bugfix][Core] add seq_id_to_seq_group clearing to avoid memory leak when s… (vllm-project#16472)
Signed-off-by: 开哲 <[email protected]> Co-authored-by: 开哲 <[email protected]>
1 parent a64a844 commit 196332c

File tree

1 file changed

+7
-0
lines changed

1 file changed

+7
-0
lines changed

vllm/outputs.py

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -201,6 +201,13 @@ def from_seq_group(
201201
group.finish_seq(seq_group)
202202
if assembled_seq_group is None:
203203
return None
204+
205+
# clear finished seq in seq_id_to_seq_group
206+
if len(group.to_be_finished) == 0:
207+
for sub_request_id in list(group.seq_id_to_index.keys()):
208+
if sub_request_id in seq_id_to_seq_group:
209+
del seq_id_to_seq_group[sub_request_id]
210+
204211
return cls.from_seq_group(assembled_seq_group, use_cache,
205212
seq_id_to_seq_group)
206213

0 commit comments

Comments
 (0)