Skip to content

Commit 3f91c20

Browse files
bobbolidominicshanshan
authored andcommitted
[https://nvbugs/5467548][fix] DeepSeek illegal memory access. (NVIDIA#7298)
Signed-off-by: Bo Li <[email protected]> Signed-off-by: Wangshanshan <[email protected]>
1 parent 8a52015 commit 3f91c20

File tree

1 file changed

+2
-0
lines changed
  • tensorrt_llm/_torch/attention_backend

1 file changed

+2
-0
lines changed

tensorrt_llm/_torch/attention_backend/trtllm.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -843,8 +843,10 @@ def prepare_flash_mla(self) -> None:
843843
block_ids_per_seq = self.kv_cache_manager.get_block_ids_per_seq(
844844
self.request_ids).pin_memory()
845845
num_blocks = block_ids_per_seq.shape[1]
846+
self.kv_block_ids_per_seq.fill_(0)
846847
self.kv_block_ids_per_seq[:self.num_seqs, :num_blocks].copy_(
847848
block_ids_per_seq, non_blocking=True)
849+
self.block_ids_per_seq.fill_(0)
848850
self.block_ids_per_seq[:self.num_generations, :num_blocks].copy_(
849851
block_ids_per_seq[self.num_contexts:], non_blocking=True)
850852

0 commit comments

Comments
 (0)