Skip to content

Conversation

@WoosukKwon
Copy link
Collaborator

@WoosukKwon WoosukKwon commented May 7, 2023

This PR includes extensive refactoring of the system.

Major changes are:

  • Moved parallel_utils into model_executor
  • Moved simple_frontend to frontend
  • Moved gradio_webserver and test_cli_client to the root
  • Removed plot

@WoosukKwon WoosukKwon requested a review from zhuohan123 May 7, 2023 23:46
This was referenced May 8, 2023
@WoosukKwon WoosukKwon merged commit 7c041ab into main May 9, 2023
@WoosukKwon WoosukKwon deleted the refactor-arch branch May 9, 2023 22:30
hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
yukavio pushed a commit to yukavio/vllm that referenced this pull request Jul 3, 2024
Fix repo link in setup.py
dllehr-amd pushed a commit to dllehr-amd/vllm that referenced this pull request Jul 22, 2024
…eation_before_each_gemm

Charlifu/avoid tensor creation before each gemm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants