[Frontend] speed up import time of vllm.config #18036

davidxia · 2025-05-13T01:09:50Z

by changing some modules in vllm/multimodal to lazily import expensive modules like transformers or only importing them for type checkers when not used during runtime.

contributes to #14924

I ran on main branch python -X importtime -c 'import vllm' 2> import.log && tuna import.log. The visualized call tree shows vllm.config accounts for the majority of the total import time at 55.5%.

On this branch, vllm.config's share decreased to 52.5%.

`python -c 'import vllm'`

on a Google Compute Engine a2-highgpu-1g (12 vCPUs, 85 GB Memory) instance with 1 A100 GPU

~3% decrease in mean time

before (main branch commit `94d8ec8`)

$ hyperfine 'python -c "import vllm"' --warmup 3 --runs 100
Benchmark 1: python -c "import vllm"
  Time (mean ± σ):      9.643 s ±  0.140 s    [User: 10.393 s, System: 1.913 s]
  Range (min … max):    9.485 s … 10.084 s    100 runs

after (my PR commit 054f562)

$ hyperfine 'python -c "import vllm"' --warmup 3 --runs 100
Benchmark 1: python -c "import vllm"
  Time (mean ± σ):      9.355 s ±  0.096 s    [User: 10.284 s, System: 1.740 s]
  Range (min … max):    9.205 s …  9.711 s    100 runs

github-actions · 2025-05-13T01:09:59Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mergify · 2025-05-15T09:17:31Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @davidxia.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

davidxia · 2025-05-21T21:25:42Z

@aarnphm this is ready for review, thanks! cc @simon-mo @Chen-0210

mergify · 2025-05-23T01:46:38Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @davidxia.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

aarnphm

I'm a bit hesitant to optimize this file lazily, given that this touches a lot of components within vLLM.

Also let's try to reduce some hint change to minimum.

This PR will requires running the whole suite to make sure it won't introduce any regression.

vllm/config.py

vllm/engine/arg_utils.py

mergify · 2025-05-28T12:47:06Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @davidxia.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

vllm/config.py

mergify · 2025-05-29T03:13:23Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @davidxia.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

mergify · 2025-06-01T03:05:25Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @davidxia.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

by changing submodules to lazily import expensive modules like `vllm.model_executor.layers.quantization` or only importing them for type checkers when not used during runtime. contributes to vllm-project#14924 Signed-off-by: David Xia <[email protected]>

davidxia · 2025-06-24T17:03:33Z

@aarnphm thanks for reviewing again. I rebased away the conflict and fix the pre-commit Python formatting check. All checks pass now and ready for another review. 🙏

davidxia · 2025-06-25T15:35:17Z

On my M1 Mac with 64GB memory, py312, editable install of vllm following these docs

before with master commit `3443aaf`

$ hyperfine 'python -c "import vllm.config"' --warmup 10 --runs 100
Benchmark 1: python -c "import vllm.config"
  Time (mean ± σ):      4.332 s ±  0.088 s    [User: 5.246 s, System: 2.792 s]
  Range (min … max):    4.151 s …  4.748 s    100 runs

after with master commit `7108934`

$ hyperfine 'python -c "import vllm.config"' --warmup 10 --runs 100
Benchmark 1: python -c "import vllm.config"
  Time (mean ± σ):      4.239 s ±  0.040 s    [User: 5.104 s, System: 2.981 s]
  Range (min … max):    4.145 s …  4.389 s    100 runs

~2.15% speed up in the average import times ((4.332-4.239)÷4.332)

davidxia force-pushed the patch4 branch 3 times, most recently from c1b18c2 to 6921702 Compare May 15, 2025 03:36

mergify bot added the needs-rebase label May 15, 2025

davidxia force-pushed the patch4 branch from 6921702 to da2afe0 Compare May 15, 2025 21:14

mergify bot removed the needs-rebase label May 15, 2025

davidxia marked this pull request as ready for review May 15, 2025 22:36

davidxia mentioned this pull request May 15, 2025

[Feature]: Reduce vLLM's import time #14924

Closed

1 task

davidxia force-pushed the patch4 branch 4 times, most recently from bb4c8d5 to 054f562 Compare May 21, 2025 18:26

mergify bot added the needs-rebase label May 23, 2025

davidxia force-pushed the patch4 branch from 054f562 to a2a638c Compare May 23, 2025 13:43

mergify bot removed the needs-rebase label May 23, 2025

aarnphm requested changes May 23, 2025

View reviewed changes

mergify bot added the needs-rebase label May 28, 2025

davidxia force-pushed the patch4 branch from a2a638c to 3c9b118 Compare May 28, 2025 15:20

mergify bot removed the needs-rebase label May 28, 2025

aarnphm reviewed May 28, 2025

View reviewed changes

vllm/config.py Outdated Show resolved Hide resolved

aarnphm reviewed May 28, 2025

View reviewed changes

vllm/config.py Outdated Show resolved Hide resolved

aarnphm reviewed May 28, 2025

View reviewed changes

vllm/config.py Outdated Show resolved Hide resolved

aarnphm reviewed May 28, 2025

View reviewed changes

vllm/config.py Outdated Show resolved Hide resolved

davidxia force-pushed the patch4 branch from 3c9b118 to e3a1e2b Compare May 28, 2025 17:05

mergify bot added the needs-rebase label May 29, 2025

davidxia force-pushed the patch4 branch from e3a1e2b to e81dab6 Compare May 29, 2025 12:55

mergify bot removed the needs-rebase label May 29, 2025

davidxia force-pushed the patch4 branch 2 times, most recently from 1ec7328 to 6a65c3d Compare May 29, 2025 13:04

mergify bot added the needs-rebase label Jun 1, 2025

aarnphm approved these changes Jun 22, 2025

View reviewed changes

aarnphm enabled auto-merge (squash) June 22, 2025 03:13

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jun 22, 2025

aarnphm added frontend and removed needs-rebase labels Jun 22, 2025

auto-merge was automatically disabled June 23, 2025 23:13
Head branch was pushed to by a user without write access

davidxia force-pushed the patch4 branch from 1c39cf4 to 5e0dbea Compare June 23, 2025 23:13

davidxia force-pushed the patch4 branch from 5e0dbea to 82fd70e Compare June 24, 2025 12:48

aarnphm merged commit 7108934 into vllm-project:main Jun 25, 2025
68 checks passed

davidxia deleted the patch4 branch June 25, 2025 11:29

Uh oh!

[Frontend] speed up import time of vllm.config #18036

[Frontend] speed up import time of vllm.config #18036

Uh oh!

Conversation

davidxia commented May 13, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

python -c 'import vllm'

before (main branch commit 94d8ec8)

after (my PR commit 054f562)

Uh oh!

github-actions bot commented May 13, 2025

Uh oh!

mergify bot commented May 15, 2025

Uh oh!

davidxia commented May 21, 2025

Uh oh!

mergify bot commented May 23, 2025

Uh oh!

aarnphm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify bot commented May 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify bot commented May 29, 2025

Uh oh!

mergify bot commented Jun 1, 2025

Uh oh!

davidxia commented Jun 24, 2025

Uh oh!

Uh oh!

davidxia commented Jun 25, 2025

before with master commit 3443aaf

after with master commit 7108934

Uh oh!

Uh oh!

davidxia commented May 13, 2025 •

edited by github-actions bot

Loading

`python -c 'import vllm'`

before (main branch commit `94d8ec8`)

before with master commit `3443aaf`

after with master commit `7108934`