From 08d49ae8ce01600601caaa06cf5359ffd962907c Mon Sep 17 00:00:00 2001 From: Enwei Zhu <21126786+syuoni@users.noreply.github.com> Date: Tue, 24 Jun 2025 02:01:16 +0000 Subject: [PATCH 1/2] fix Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com> --- examples/ep_load_balancer/README.md | 3 +++ .../_torch/modules/fused_moe/fused_moe_wide_ep.py | 8 ++++---- 2 files changed, 7 insertions(+), 4 deletions(-) diff --git a/examples/ep_load_balancer/README.md b/examples/ep_load_balancer/README.md index eaa277c4a18..6a693bf3ba8 100644 --- a/examples/ep_load_balancer/README.md +++ b/examples/ep_load_balancer/README.md @@ -27,6 +27,7 @@ Run 32-way expert parallelism inference on the prepared dataset. Please refer to cat > ./extra_llm_api_options.yaml < ./extra_llm_api_options_eplb.yaml < ./extra_llm_api_options_eplb.yaml < Date: Tue, 24 Jun 2025 18:27:06 +0800 Subject: [PATCH 2/2] unwaive Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com> --- tests/integration/test_lists/waives.txt | 1 - 1 file changed, 1 deletion(-) diff --git a/tests/integration/test_lists/waives.txt b/tests/integration/test_lists/waives.txt index 7d33974d394..6b69817e3b5 100644 --- a/tests/integration/test_lists/waives.txt +++ b/tests/integration/test_lists/waives.txt @@ -441,4 +441,3 @@ accuracy/test_disaggregated_serving.py::TestDeepSeekV3Lite::test_auto_dtype[mtp_ accuracy/test_llm_api_pytorch.py::TestDeepSeekV3Lite::test_fp8_block_scales[mtp=vanilla-fp8kv=False-attention_dp=False-cuda_graph=False-overlap_scheduler=False-torch_compile=False] SKIP (https://nvbugs/5354946) examples/test_multimodal.py::test_llm_multimodal_general[kosmos-2-pp:1-tp:1-float16-bs:1-cpp_e2e:True-nb:1] SKIP (https://nvbugs/5354936) examples/test_multimodal.py::test_llm_multimodal_general[fuyu-8b-pp:1-tp:1-float16-bs:1-cpp_e2e:True-nb:1] SKIP (https://nvbugs/5354936) -accuracy/test_llm_api_pytorch.py::TestDeepSeekV3Lite::test_fp8_block_scales_4gpus_static_eplb SKIP (https://nvbugs/5354925)