Skip to content

Conversation

@louie-tsai
Copy link
Contributor

@louie-tsai louie-tsai commented May 21, 2025

Need to standardize vLLM CPU benchmarks among customers and intel users by using vLLM benchmark suite.
Also hope to enable CPU perf numbers on vLLM performance dashboard.

Enable vLLM benchmark suite for CPU and below are snapshot of serving benchmark report.
numbers are aligned with our Xeon EMR numbers.

How to run it on CPU under vllm folder:
ON_CPU=1 bash .buildkite/nightly-benchmarks/scripts/run-performance-benchmarks.sh

also added a new section "Platform Information" section to list out CPU info

image

Here is the full report.
benchmark_results_0527_3.md

overall, it took ~2 hours for current tests.

it also needs to have auto OMP thread binding from below PR.
#17930

Also added a compare-json-results.py to compare among different benchmark-results.json files
image

@github-actions
Copy link

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

@mergify mergify bot added the ci/build label May 21, 2025
@louie-tsai louie-tsai force-pushed the nightly_cpu_benchmark branch 9 times, most recently from 913611c to 298aba1 Compare June 3, 2025 00:21
@louie-tsai louie-tsai force-pushed the nightly_cpu_benchmark branch 3 times, most recently from 1299794 to 51ede3a Compare June 4, 2025 01:09
@louie-tsai
Copy link
Contributor Author

@bigPYJ1151 could you help to review this PR?

@louie-tsai louie-tsai force-pushed the nightly_cpu_benchmark branch 2 times, most recently from 2f11d5e to 21ce351 Compare June 4, 2025 21:22
@louie-tsai
Copy link
Contributor Author

@xuechendi please help to review and merge the PR. thanks

@louie-tsai louie-tsai force-pushed the nightly_cpu_benchmark branch 2 times, most recently from 5bf0667 to 433ed5c Compare June 9, 2025 17:53
@xuechendi
Copy link
Contributor

@louie-tsai , It looks like the rename for GPU script is not necessary, do you think it is OK to drop changes to existing GPU script and only add for CPU?

@xuechendi
Copy link
Contributor

@bigPYJ1151 , please take a look of this PR, the custom facing team want to provide CPU benchmark script from VLLM upstream repo, so customer can reproduce the number easily. Please check if current test settings makes sense to you.

@louie-tsai louie-tsai force-pushed the nightly_cpu_benchmark branch 4 times, most recently from 7eb926e to f5243fc Compare June 10, 2025 20:05
@louie-tsai
Copy link
Contributor Author

@louie-tsai , It looks like the rename for GPU script is not necessary, do you think it is OK to drop changes to existing GPU script and only add for CPU?

addressed it accordingly

Copy link
Member

@bigPYJ1151 bigPYJ1151 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Frankly speaking, I'm not sure reusing the benchmark script between CPU and GPU is a good idea.

Meanwhile, this PR also relates to vllm CI infra, needs to setup benchmark machines and pipelines. Do we have plan to establish these?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I also think adding -gpu suffix is not necessary as it will introduce lots of changes.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changed it accordingly.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Perhaps numa should be added to requirements like test.in.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added them accordingly.
We are targeting on release docker image. should we use test image instead?
if yes, do you have build command for test image?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right, the test image included more dictionaries and packages for testing/benchmarking.

To build it, just add --target=vllm-test when building the CPU image.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this change required?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

used to face a parsing issue on CPU results, but no issue on the latest code. remove the changes.

@louie-tsai louie-tsai force-pushed the nightly_cpu_benchmark branch 2 times, most recently from a876145 to 2353aa2 Compare June 11, 2025 21:47
louie-tsai added 16 commits July 2, 2025 10:50
Signed-off-by: Tsai, Louie <[email protected]>
Signed-off-by: Tsai, Louie <[email protected]>
Signed-off-by: Tsai, Louie <[email protected]>
…-to-markdown.py

Co-authored-by: Eero Tamminen <[email protected]>

Signed-off-by: Tsai, Louie <[email protected]>
Signed-off-by: Tsai, Louie <[email protected]>
@louie-tsai louie-tsai reopened this Jul 2, 2025
@louie-tsai louie-tsai force-pushed the nightly_cpu_benchmark branch from 8b05811 to 104c444 Compare July 2, 2025 17:53
@louie-tsai
Copy link
Contributor Author

rebase before #20200 to avoid CI/CD issue from it

@simon-mo simon-mo merged commit 9965c47 into vllm-project:main Jul 3, 2025
13 checks passed
Pradyun92 pushed a commit to Pradyun92/vllm that referenced this pull request Aug 6, 2025
jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025
@ghost ghost mentioned this pull request Oct 9, 2025
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build performance Performance-related issues

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants