- 
                Notifications
    You must be signed in to change notification settings 
- Fork 663
build: enable kvbm in trtllm container #2956
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Anant Sharma <[email protected]>
Signed-off-by: Anant Sharma <[email protected]>
285a7b0    to
    716d177      
    Compare
  
    Signed-off-by: Anant Sharma <[email protected]>
Signed-off-by: Anant Sharma <[email protected]>
Signed-off-by: Anant Sharma <[email protected]>
| WalkthroughThe build script now enables KVBM when FRAMEWORK is either "VLLM" or "TRTLLM" in both relevant condition blocks. Tests adjusted: one vLLM-specific marker removed to broaden scope; one flaky test is skipped with a reason tag. Changes
 Sequence Diagram(s)sequenceDiagram
  autonumber
  participant Dev as Invoker
  participant Build as container/build.sh
  participant Env as ENV Vars
  participant Img as Docker Build
  Dev->>Build: Run build with FRAMEWORK
  Build->>Env: Read FRAMEWORK
  alt FRAMEWORK in {"VLLM","TRTLLM"}
    Build->>Env: Set ENABLE_KVBM=true
    note over Build,Env: KVBM forced on for VLLM/TRTLLM
  else Other frameworks
    Build->>Env: Leave ENABLE_KVBM as configured
  end
  Build->>Img: Proceed with image build
  Img-->>Dev: Image artifact
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Possibly related PRs
 Poem
 Pre-merge checks❌ Failed checks (1 warning)
 ✅ Passed checks (2 passed)
 Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment  | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 0
🧹 Nitpick comments (2)
tests/kvbm/test_determinism.py (1)
918-927: Prefer xfail over skip to keep signalMark flaky as xfail so it still runs and reports.
- @pytest.mark.skip(reason="Flaky test: DIS-665") + @pytest.mark.xfail(reason="Flaky test: DIS-665", strict=False) def test_concurrent_determinism_with_ifeval(container/build.sh (1)
696-699: Fix misleading log; Dockerfile.trtllm consumes ENABLE_KVBM
- Update the echo to reference the actual $FRAMEWORK being built.
- Dockerfile.trtllm declares
ARG ENABLE_KVBM=false, so TRTLLM will receive the build arg.-if [[ $FRAMEWORK == "VLLM" ]] || [[ $FRAMEWORK == "TRTLLM" ]]; then - echo "Forcing enable_kvbm to true in vLLM image build" +if [[ $FRAMEWORK == "VLLM" ]] || [[ $FRAMEWORK == "TRTLLM" ]]; then + echo "Forcing ENABLE_KVBM=true for ${FRAMEWORK} image build" ENABLE_KVBM=true fi
Signed-off-by: Anant Sharma <[email protected]>
Signed-off-by: Anant Sharma <[email protected]> Signed-off-by: Jason Zhou <[email protected]>
Signed-off-by: Anant Sharma <[email protected]> Signed-off-by: Jason Zhou <[email protected]>
Signed-off-by: Anant Sharma <[email protected]> Signed-off-by: Kyle H <[email protected]>
closes: OPS-850
Summary by CodeRabbit
Improvements
Tests