Skip to content

Commit 74a8beb

Browse files
committed
remove disable_overlap_scheduler: false
Signed-off-by: raayandhar <[email protected]>
1 parent 5102bcd commit 74a8beb

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

docs/source/blogs/tech_blog/blog6_Llama4_maverick_eagle_guide.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -68,7 +68,7 @@ docker run -d --ipc=host --ulimit memlock=-1 --ulimit stack=67108864 \
6868
-p 8000:8000 --gpus=all -e "TRTLLM_ENABLE_PDL=1" \
6969
-v /path/to/maverick:/config/models/maverick -v /path/to/eagle:/config/models/eagle \
7070
docker.io/<username>/tensorrt_llm:main sh \
71-
-c "echo -e 'disable_overlap_scheduler: false\nenable_autotuner: false\nenable_attention_dp: false\nenable_min_latency: true\ncuda_graph_config:\n max_batch_size: 8\nspeculative_config:\n decoding_type: Eagle\n max_draft_len: 3\n speculative_model_dir: /config/models/eagle\n eagle3_one_model: true\nkv_cache_config:\n enable_block_reuse: false' > c.yaml && \
71+
-c "echo -e 'enable_autotuner: false\nenable_attention_dp: false\nenable_min_latency: true\ncuda_graph_config:\n max_batch_size: 8\nspeculative_config:\n decoding_type: Eagle\n max_draft_len: 3\n speculative_model_dir: /config/models/eagle\n eagle3_one_model: true\nkv_cache_config:\n enable_block_reuse: false' > c.yaml && \
7272
TRT_LLM_DISABLE_LOAD_WEIGHTS_IN_PARALLEL=True \
7373
trtllm-serve /config/models/maverick \
7474
--host 0.0.0.0 --port 8000 \

0 commit comments

Comments
 (0)