Skip to content

Commit 3d97e01

Browse files
committed
disable kvcache block reuse for disagg+pp tests
Signed-off-by: Lizhi Zhou <[email protected]>
1 parent b6eba85 commit 3d97e01

File tree

4 files changed

+7
-0
lines changed

4 files changed

+7
-0
lines changed

tests/integration/defs/disaggregated/test_configs/disagg_config_ctxpp2_genpp2.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ context_servers:
1414
kv_cache_config:
1515
free_gpu_memory_fraction: 0.2
1616
enable_partial_reuse: False
17+
enable_block_reuse: False
1718
disable_overlap_scheduler: True
1819
cache_transceiver_config:
1920
backend: DEFAULT
@@ -29,6 +30,7 @@ generation_servers:
2930
kv_cache_config:
3031
free_gpu_memory_fraction: 0.2
3132
enable_partial_reuse: False
33+
enable_block_reuse: False
3234
disable_overlap_scheduler: True
3335
cache_transceiver_config:
3436
backend: DEFAULT

tests/integration/defs/disaggregated/test_configs/disagg_config_ctxpp2_gentp2.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ context_servers:
1414
kv_cache_config:
1515
free_gpu_memory_fraction: 0.2
1616
enable_partial_reuse: False
17+
enable_block_reuse: False
1718
disable_overlap_scheduler: True
1819
cache_transceiver_config:
1920
backend: DEFAULT

tests/integration/defs/disaggregated/test_configs/disagg_config_ctxpp4_genpp4.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ context_servers:
1414
kv_cache_config:
1515
free_gpu_memory_fraction: 0.2
1616
enable_partial_reuse: False
17+
enable_block_reuse: False
1718
disable_overlap_scheduler: True
1819
cache_transceiver_config:
1920
backend: DEFAULT
@@ -29,6 +30,7 @@ generation_servers:
2930
kv_cache_config:
3031
free_gpu_memory_fraction: 0.2
3132
enable_partial_reuse: False
33+
enable_block_reuse: False
3234
disable_overlap_scheduler: True
3335
cache_transceiver_config:
3436
backend: DEFAULT

tests/integration/defs/disaggregated/test_configs/disagg_config_ctxtp2pp2_gentp2pp2.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ context_servers:
1414
kv_cache_config:
1515
free_gpu_memory_fraction: 0.2
1616
enable_partial_reuse: False
17+
enable_block_reuse: False
1718
disable_overlap_scheduler: True
1819
cache_transceiver_config:
1920
backend: DEFAULT
@@ -29,6 +30,7 @@ generation_servers:
2930
kv_cache_config:
3031
free_gpu_memory_fraction: 0.2
3132
enable_partial_reuse: False
33+
enable_block_reuse: False
3234
disable_overlap_scheduler: True
3335
cache_transceiver_config:
3436
backend: DEFAULT

0 commit comments

Comments
 (0)