diff --git a/README.md b/README.md index 7d3bfc14d4c..20fa6950ae7 100644 --- a/README.md +++ b/README.md @@ -227,3 +227,4 @@ To get started with TensorRT-LLM, visit our documentation: - [Quantized models on Hugging Face](https://huggingface.co/collections/nvidia/model-optimizer-66aa84f7966b3150262481a4): A growing collection of quantized (e.g., FP8, FP4) and optimized LLMs, including [DeepSeek FP4](https://huggingface.co/nvidia/DeepSeek-R1-FP4), ready for fast inference with TensorRT-LLM. - [NVIDIA Dynamo](https://github.com/ai-dynamo/dynamo): A datacenter scale distributed inference serving framework that works seamlessly with TensorRT-LLM. - [AutoDeploy](./examples/auto_deploy/README.md): An experimental backend for TensorRT-LLM to simplify and accelerate the deployment of PyTorch models. +- [WeChat Discussion Group](https://github.com/NVIDIA/TensorRT-LLM/issues/5359): A real-time channel for TensorRT-LLM Q&A and news. diff --git a/docs/source/media/Wechat_Group_QR_Code.png b/docs/source/media/Wechat_Group_QR_Code.png new file mode 100644 index 00000000000..9fa3a37c3ba Binary files /dev/null and b/docs/source/media/Wechat_Group_QR_Code.png differ