Enable CPU nightly performance benchmark and its Markdown report #18444

louie-tsai · 2025-05-21T00:30:59Z

Need to standardize vLLM CPU benchmarks among customers and intel users by using vLLM benchmark suite.
Also hope to enable CPU perf numbers on vLLM performance dashboard.

Enable vLLM benchmark suite for CPU and below are snapshot of serving benchmark report.
numbers are aligned with our Xeon EMR numbers.

How to run it on CPU under vllm folder:
ON_CPU=1 bash .buildkite/nightly-benchmarks/scripts/run-performance-benchmarks.sh

also added a new section "Platform Information" section to list out CPU info

Here is the full report.
benchmark_results_0527_3.md

overall, it took ~2 hours for current tests.

it also needs to have auto OMP thread binding from below PR.
#17930

Also added a compare-json-results.py to compare among different benchmark-results.json files

github-actions · 2025-05-21T00:31:06Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

louie-tsai · 2025-06-04T01:10:59Z

@bigPYJ1151 could you help to review this PR?

louie-tsai · 2025-06-04T22:16:15Z

@xuechendi please help to review and merge the PR. thanks

xuechendi · 2025-06-09T19:01:12Z

@louie-tsai , It looks like the rename for GPU script is not necessary, do you think it is OK to drop changes to existing GPU script and only add for CPU?

xuechendi · 2025-06-09T19:03:56Z

@bigPYJ1151 , please take a look of this PR, the custom facing team want to provide CPU benchmark script from VLLM upstream repo, so customer can reproduce the number easily. Please check if current test settings makes sense to you.

louie-tsai · 2025-06-10T23:57:18Z

@louie-tsai , It looks like the rename for GPU script is not necessary, do you think it is OK to drop changes to existing GPU script and only add for CPU?

addressed it accordingly

bigPYJ1151

Frankly speaking, I'm not sure reusing the benchmark script between CPU and GPU is a good idea.

Meanwhile, this PR also relates to vllm CI infra, needs to setup benchmark machines and pipelines. Do we have plan to establish these?

bigPYJ1151 · 2025-06-11T09:07:01Z

.buildkite/nightly-benchmarks/README.md

I also think adding -gpu suffix is not necessary as it will introduce lots of changes.

changed it accordingly.

bigPYJ1151 · 2025-06-11T09:26:51Z

.buildkite/nightly-benchmarks/scripts/convert-results-json-to-markdown.py

Perhaps numa should be added to requirements like test.in.

added them accordingly.
We are targeting on release docker image. should we use test image instead?
if yes, do you have build command for test image?

Right, the test image included more dictionaries and packages for testing/benchmarking.

To build it, just add --target=vllm-test when building the CPU image.

bigPYJ1151 · 2025-06-11T09:28:22Z

.buildkite/nightly-benchmarks/scripts/convert-results-json-to-markdown.py

Why is this change required?

used to face a parsing issue on CPU results, but no issue on the latest code. remove the changes.

Signed-off-by: Tsai, Louie <[email protected]>

Signed-off-by: Tsai, Louie <[email protected]> fix

Signed-off-by: Tsai, Louie <[email protected]>

…-to-markdown.py Co-authored-by: Eero Tamminen <[email protected]> Signed-off-by: Tsai, Louie <[email protected]>

Signed-off-by: Tsai, Louie <[email protected]>

louie-tsai · 2025-07-02T17:55:04Z

rebase before #20200 to avoid CI/CD issue from it

…m-project#18444) Signed-off-by: Tsai, Louie <[email protected]>

…m-project#18444) Signed-off-by: Tsai, Louie <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

mergify bot added the ci/build label May 21, 2025

louie-tsai force-pushed the nightly_cpu_benchmark branch 9 times, most recently from 913611c to 298aba1 Compare June 3, 2025 00:21

louie-tsai force-pushed the nightly_cpu_benchmark branch 3 times, most recently from 1299794 to 51ede3a Compare June 4, 2025 01:09

louie-tsai force-pushed the nightly_cpu_benchmark branch 2 times, most recently from 2f11d5e to 21ce351 Compare June 4, 2025 21:22

louie-tsai force-pushed the nightly_cpu_benchmark branch 2 times, most recently from 5bf0667 to 433ed5c Compare June 9, 2025 17:53

louie-tsai force-pushed the nightly_cpu_benchmark branch 4 times, most recently from 7eb926e to f5243fc Compare June 10, 2025 20:05

bigPYJ1151 reviewed Jun 11, 2025

View reviewed changes

louie-tsai force-pushed the nightly_cpu_benchmark branch 2 times, most recently from a876145 to 2353aa2 Compare June 11, 2025 21:47

louie-tsai added 16 commits July 2, 2025 10:50

add platform info into performance.md report

02b5deb

Signed-off-by: Tsai, Louie <[email protected]>

remove one cpu test case

4b61b43

Signed-off-by: Tsai, Louie <[email protected]>

add benchmark scripts

9ada622

Signed-off-by: Tsai, Louie <[email protected]>

pre-commit fix and throughput test fixes

d86df70

Signed-off-by: Tsai, Louie <[email protected]>

add remote vllm server testing

89f6e3b

Signed-off-by: Tsai, Louie <[email protected]>

assign json files manually for testing

c2fd441

Signed-off-by: Tsai, Louie <[email protected]>

fix a f-string expression can't include backsplash issue

4345d8c

Signed-off-by: Tsai, Louie <[email protected]>

Update README.md for CPU

8468059

Signed-off-by: Tsai, Louie <[email protected]>

add V1 support for cpu testing

5dff636

Signed-off-by: Tsai, Louie <[email protected]>

add sharegpt data TP=4 testing for TP scaling test on CPU

733bf3c

Signed-off-by: Tsai, Louie <[email protected]>

Add a json comparison script to compare two benchmark_results.json files

bd4ce6f

Signed-off-by: Tsai, Louie <[email protected]>

add more tests to cover different number of numa nodes on CPU/Xeon

b935bad

Signed-off-by: Tsai, Louie <[email protected]>

Addressed review feedback and restore original json file names for GPU

07d7522

Signed-off-by: Tsai, Louie <[email protected]> fix

optimiztions and max-concurrency for serving

d2a831c

Signed-off-by: Tsai, Louie <[email protected]>

Update .buildkite/nightly-benchmarks/README.md & convert-results-json…

d7798a1

…-to-markdown.py Co-authored-by: Eero Tamminen <[email protected]> Signed-off-by: Tsai, Louie <[email protected]>

remove deprecated argument

104c444

Signed-off-by: Tsai, Louie <[email protected]>

louie-tsai reopened this Jul 2, 2025

louie-tsai force-pushed the nightly_cpu_benchmark branch from 8b05811 to 104c444 Compare July 2, 2025 17:53

simon-mo approved these changes Jul 3, 2025

View reviewed changes

simon-mo merged commit 9965c47 into vllm-project:main Jul 3, 2025
13 checks passed

noooop mentioned this pull request Jul 14, 2025

[Frontend] Update the warning log when using VLLM_ALLOW_LONG_MAX_MODEL_LEN #20904

Merged

4 tasks

Pradyun92 pushed a commit to Pradyun92/vllm that referenced this pull request Aug 6, 2025

Enable CPU nightly performance benchmark and its Markdown report (vll…

22779c3

…m-project#18444) Signed-off-by: Tsai, Louie <[email protected]>

jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025

Enable CPU nightly performance benchmark and its Markdown report (vll…

fd5411a

…m-project#18444) Signed-off-by: Tsai, Louie <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

ghost mentioned this pull request Oct 9, 2025

Enable aarch64 CPU performance benchmarks #26494

Open

4 tasks

Uh oh!

Enable CPU nightly performance benchmark and its Markdown report #18444

Enable CPU nightly performance benchmark and its Markdown report #18444

Uh oh!

Conversation

louie-tsai commented May 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 21, 2025

Uh oh!

louie-tsai commented Jun 4, 2025

Uh oh!

louie-tsai commented Jun 4, 2025

Uh oh!

xuechendi commented Jun 9, 2025

Uh oh!

xuechendi commented Jun 9, 2025

Uh oh!

louie-tsai commented Jun 10, 2025

Uh oh!

bigPYJ1151 left a comment

Choose a reason for hiding this comment

Uh oh!

bigPYJ1151 Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

louie-tsai Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

bigPYJ1151 Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

louie-tsai Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

bigPYJ1151 Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

bigPYJ1151 Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

louie-tsai Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

louie-tsai commented Jul 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

louie-tsai commented May 21, 2025 •

edited by github-actions bot

Loading