Skip to content

Conversation

@suquark
Copy link
Contributor

@suquark suquark commented Apr 16, 2023

Bug fixed and ready to run version

@zhuohan123
Copy link
Member

Close this PR since it's too diverged from the current main.

@zhuohan123 zhuohan123 closed this Jun 17, 2023
@zhuohan123 zhuohan123 deleted the prefix_siyuan branch June 18, 2023 07:30
@huangtingwei9988
Copy link

@suquark hi,Will it work properly when you complete this PR?

fxmarty pushed a commit to fxmarty/vllm-public that referenced this pull request Jun 12, 2024
joerunde added a commit to joerunde/vllm that referenced this pull request Jun 17, 2024
This includes some fixes for supporting vllm 0.4.3+.

Mostly the `generate` api changed, so we have to update our grpc server
accordingly

---------

Signed-off-by: Joe Runde <[email protected]>
yukavio pushed a commit to yukavio/vllm that referenced this pull request Jul 3, 2024
@alixiaodi alixiaodi mentioned this pull request Aug 2, 2024
bigPYJ1151 added a commit to bigPYJ1151/vllm that referenced this pull request Aug 8, 2024
* fix rope

* add warming-up

* add shm gather
heheda12345 pushed a commit to heheda12345/vllm that referenced this pull request Sep 29, 2025
* Squashed commit of lwilkinson/decode-only changes relative to origin/dev

Co-authored-by: Matthew Bonanni <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>

* update FlashMLA

Signed-off-by: Lucas Wilkinson <[email protected]>

* fix non-spec error

Signed-off-by: Lucas Wilkinson <[email protected]>

---------

Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Chen Zhang <[email protected]>
Co-authored-by: Matthew Bonanni <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants