[Benchmark][Doc] Update throughput benchmark and README #15998

StevenShi-23 · 2025-04-03T07:29:48Z

This PR updates benchmark README for the latest change from #15955 . Also, it adds AIMODataset to benchmark_throughput.py

@JenZhao @ywang96 Thank you for the comments!

Signed-off-by: StevenShi-23 <[email protected]>

github-actions · 2025-04-03T07:29:58Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

JenZhao · 2025-04-03T17:54:37Z

benchmarks/benchmark_throughput.py

-
+        elif args.dataset_path in AIMODataset.SUPPORTED_DATASET_PATHS:
+            dataset_cls = AIMODataset
+            common_kwargs['dataset_subset'] = args.hf_subset


if dataset_subset is not needed by this dataset you can put it as None

Hi, thanks for the comments. I leave it because hf_subset by default is None. I'll explicit set it to None to avoid error.

JenZhao · 2025-04-03T17:55:05Z

benchmarks/README.md

+  --backend vllm \
+  --dataset-name hf \
+  --dataset-path AI-MO/aimo-validation-aime \
+  --hf-split train \


you can remove this line --hf-split train \

JenZhao · 2025-04-03T18:04:13Z

Thank you! As a precaution, could you run both online and offline with the same seed to see if the token counts match?

StevenShi-23 · 2025-04-04T06:17:28Z

Thank you! As a precaution, could you run both online and offline with the same seed to see if the token counts match?

I ran the offline and online benchmark with the same seed, and the input token count did not match (1204 vs 1229). But the token count is reproducible within offline or online benchmark itself using the same seed.

I think it may deserve a separate PR to investigate it.

Signed-off-by: StevenShi-23 <[email protected]>

ywang96 · 2025-04-04T16:10:34Z

benchmarks/benchmark_throughput.py

+        elif args.dataset_path in AIMODataset.SUPPORTED_DATASET_PATHS:
+            assert args.backend == "vllm", "AIMODataset needs to use vllm as the backend."  #noqa: E501


Hmm this if elif is growing quite long so let me push a change to make it cleaner

Signed-off-by: Roger Wang <[email protected]>

…#15998) Signed-off-by: StevenShi-23 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]> Signed-off-by: xinyuxiao <[email protected]>

…#15998) Signed-off-by: StevenShi-23 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

…#15998) Signed-off-by: StevenShi-23 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]>

…#15998) Signed-off-by: StevenShi-23 <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Roger Wang <[email protected]> Signed-off-by: Mu Huai <[email protected]>

StevenShi-23 added 2 commits April 3, 2025 15:26

[Benchmark] Update readme and throughput benchmark

e3871ec

Signed-off-by: StevenShi-23 <[email protected]>

add aimo dataset split

a800045

Signed-off-by: StevenShi-23 <[email protected]>

JenZhao reviewed Apr 3, 2025

View reviewed changes

[Benchmark] Explicit set hf dataset subset for AIMO

e0dc790

Signed-off-by: StevenShi-23 <[email protected]>

ywang96 reviewed Apr 4, 2025

View reviewed changes

small refactor

143ab6a

Signed-off-by: Roger Wang <[email protected]>

ywang96 approved these changes Apr 4, 2025

View reviewed changes

ywang96 enabled auto-merge (squash) April 4, 2025 16:27

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 4, 2025

vllm-bot merged commit 95862f7 into vllm-project:main Apr 4, 2025
13 of 19 checks passed

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Benchmark][Doc] Update throughput benchmark and README #15998

[Benchmark][Doc] Update throughput benchmark and README #15998

Uh oh!

StevenShi-23 commented Apr 3, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 3, 2025

Uh oh!

JenZhao Apr 3, 2025

Uh oh!

StevenShi-23 Apr 4, 2025

Uh oh!

JenZhao Apr 3, 2025

Uh oh!

StevenShi-23 Apr 4, 2025

Uh oh!

JenZhao commented Apr 3, 2025

Uh oh!

StevenShi-23 commented Apr 4, 2025

Uh oh!

ywang96 Apr 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		elif args.dataset_path in AIMODataset.SUPPORTED_DATASET_PATHS:
		assert args.backend == "vllm", "AIMODataset needs to use vllm as the backend." #noqa: E501

Uh oh!

[Benchmark][Doc] Update throughput benchmark and README #15998

[Benchmark][Doc] Update throughput benchmark and README #15998

Uh oh!

Conversation

StevenShi-23 commented Apr 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 3, 2025

Uh oh!

JenZhao Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

StevenShi-23 Apr 4, 2025

Choose a reason for hiding this comment

Uh oh!

JenZhao Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

StevenShi-23 Apr 4, 2025

Choose a reason for hiding this comment

Uh oh!

JenZhao commented Apr 3, 2025

Uh oh!

StevenShi-23 commented Apr 4, 2025

Uh oh!

ywang96 Apr 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

StevenShi-23 commented Apr 3, 2025 •

edited by github-actions bot

Loading