docs: Add tutorial on deploying vLLM model with KServe #2586

terrytangyuan · 2024-01-24T19:19:45Z

The original source of the example can be found here: https://kserve.github.io/website/latest/modelserving/v1beta1/llm/vllm/

Signed-off-by: Yuan Tang <[email protected]>

terrytangyuan · 2024-01-30T15:10:14Z

cc @WoosukKwon @zhuohan123 @Yard1 @simon-mo Could you review this when you get a chance? Thanks!

terrytangyuan · 2024-03-01T15:50:43Z

Friendly ping!

ywang96 · 2024-03-01T16:06:23Z

Thanks for adding this guide! @terrytangyuan

FYI - this tutorial is based on Kserve + vLLM /generate API server that has been locked only for demo purposes and the OpenAI API server will be supported going forward for production purposes.

AFAIK KServe doesn't support OpenAI Schema yet and it's WIP, so how about waiting for that to be done so we can have an official guide to deploy the vLLM OpenAPI API server with KServe?

terrytangyuan · 2024-03-01T16:19:07Z

Thanks for taking a look and your thoughtful response! vLLM is already supported via kserve/kserve#3415

While OpenAI schema compatibility is in-progress, I think it's useful to correct the misunderstanding that KServe does not work with vLLM with this documentation (many community users have asked about this). Perhaps I can remove the specific example YAML but keep the link in this doc so that it points to KServe docs that we will continuously update going forward. WDYT?

ywang96 · 2024-03-01T16:23:04Z

Thanks for taking a look and your thoughtful response! vLLM is already supported via kserve/kserve#3415

While OpenAI schema compatibility is in-progress, I think it's useful to correct the misunderstanding that KServe does not work with vLLM with this documentation (many community users have asked about this). Perhaps I can remove the specific example YAML but keep the link in this doc so that it points to KServe docs that we will continuously update going forward. WDYT?

Good idea! I think this is something we should add to the doc since KServe is popular for serving orchestration with k8s. cc @simon-mo

terrytangyuan · 2024-03-01T16:28:33Z

I just updated the PR. PTAL. Thank you!

ywang96

LGTM - will need an approval from vLLM folks but I will let them know.

…2586) Signed-off-by: Yuan Tang <[email protected]>

terrytangyuan added 2 commits January 24, 2024 14:17

docs: Add tutorial on deploying vLLM model with KServe

dd5fac5

Signed-off-by: Yuan Tang <[email protected]>

fix indentation

8962d93

Signed-off-by: Yuan Tang <[email protected]>

Update deploying_with_kserve.rst

dc2bf6d

ywang96 approved these changes Mar 1, 2024

View reviewed changes

simon-mo approved these changes Mar 1, 2024

View reviewed changes

simon-mo merged commit 49d849b into vllm-project:main Mar 1, 2024

terrytangyuan deleted the kserve branch March 1, 2024 19:07

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

docs: Add tutorial on deploying vLLM model with KServe (vllm-project#…

926aa34

…2586) Signed-off-by: Yuan Tang <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

docs: Add tutorial on deploying vLLM model with KServe #2586

docs: Add tutorial on deploying vLLM model with KServe #2586

Uh oh!

terrytangyuan commented Jan 24, 2024

Uh oh!

terrytangyuan commented Jan 30, 2024

Uh oh!

terrytangyuan commented Mar 1, 2024

Uh oh!

ywang96 commented Mar 1, 2024 •

edited

Loading

Uh oh!

terrytangyuan commented Mar 1, 2024 •

edited

Loading

Uh oh!

ywang96 commented Mar 1, 2024 •

edited

Loading

Uh oh!

terrytangyuan commented Mar 1, 2024

Uh oh!

ywang96 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

docs: Add tutorial on deploying vLLM model with KServe #2586

docs: Add tutorial on deploying vLLM model with KServe #2586

Uh oh!

Conversation

terrytangyuan commented Jan 24, 2024

Uh oh!

terrytangyuan commented Jan 30, 2024

Uh oh!

terrytangyuan commented Mar 1, 2024

Uh oh!

ywang96 commented Mar 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

terrytangyuan commented Mar 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ywang96 commented Mar 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

terrytangyuan commented Mar 1, 2024

Uh oh!

ywang96 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ywang96 commented Mar 1, 2024 •

edited

Loading

terrytangyuan commented Mar 1, 2024 •

edited

Loading

ywang96 commented Mar 1, 2024 •

edited

Loading