Skip to content

Conversation

WoosukKwon
Copy link
Collaborator

@WoosukKwon WoosukKwon commented Jun 26, 2024

This PR implements Ray TPU executor for distributed inference support on TPU.

@WoosukKwon WoosukKwon added the tpu Related to Google TPUs label Jun 26, 2024
@WoosukKwon WoosukKwon changed the title [Hardware][TPU] Support tensor parallelism with Ray [Hardware][TPU] Implement tensor parallelism with Ray Jun 26, 2024
@WoosukKwon
Copy link
Collaborator Author

WoosukKwon commented Jun 26, 2024

For this PR, I will merge it after getting reviews. :)

The changes outside the TPU backend was reviewed in #6812 and #6813.

@WoosukKwon WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 21, 2024
@WoosukKwon WoosukKwon merged commit 52f07e3 into main Jul 27, 2024
@WoosukKwon WoosukKwon deleted the tpu-n branch July 27, 2024 03:54
Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024
LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Mar 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant