【Hackathon 9th No.91】FastDeploy中的MoE GroupGEMM支持INT8*INT8实现 #1164

WanRui37 · 2025-10-16T14:01:15Z

FastDeploy中的MoE GroupGEMM支持INT8*INT8实现的RFC

paddle-bot · 2025-10-16T14:01:21Z

你的PR提交成功，感谢你对开源项目的贡献!
请检查PR提交格式和内容是否完备，具体请参考示例和模版。
Your PR has been submitted. Thanks for your contribution!
Please check its format and content. For this, you can refer to Template and Demo.

ckl117 · 2025-10-23T06:52:23Z

rfcs/FastDeploy/20251016_FastDeploy_add_moe_groupgemm_int8_int8.md

+- 目前业内`MoE GroupGEMM`没有支持`INT8*INT8`的实现
+
+# 四、设计思路与实现方案
+1. 一些参考的代码路径


快速实现可以参考FD已有的wfp8afp8 triton算子，同时可以参考下vllm和TensorRT-LLM的实现方案。不限制CUDA和triton实现方案。如果在完成算子的基础上，可以加入更进一步算子融合(例如GLM4.5-AIR MoE融合共享专家层)。

感谢感谢

ckl117 · 2025-10-27T06:08:57Z

@WanRui37 如果没有修改，我就合入了？

WanRui37 · 2025-10-28T09:58:26Z

@ckl117 不好意思，后续还有修改，我代码还尚未全部完成，可以后续再合入吗？

WanRui37 and others added 4 commits October 16, 2025 16:11

v1: Simply fill in the RFC

972d105

v1: Simply fill in the RFC

19a1c75

Merge branch 'PaddlePaddle:master' into rfc_002

c5b871c

v1: Simply fill in the RFC

5a34a4e

paddle-bot bot added the contributor label Oct 16, 2025

luotao1 mentioned this pull request Oct 17, 2025

【Hackathon 9th】开源贡献个人挑战赛 PaddlePaddle/Paddle#74773

Open

luotao1 assigned luotao1 and ckl117 Oct 17, 2025

WanRui37 and others added 3 commits October 22, 2025 21:48

v2: Added some design ideas

09b6eb3

Merge branch 'PaddlePaddle:master' into rfc_002

e235203

v2: Added some design ideas

754593c

ckl117 reviewed Oct 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【Hackathon 9th No.91】FastDeploy中的MoE GroupGEMM支持INT8*INT8实现 #1164

【Hackathon 9th No.91】FastDeploy中的MoE GroupGEMM支持INT8*INT8实现 #1164

Uh oh!

WanRui37 commented Oct 16, 2025

Uh oh!

paddle-bot bot commented Oct 16, 2025

Uh oh!

ckl117 Oct 23, 2025

Uh oh!

WanRui37 Oct 23, 2025

Uh oh!

ckl117 commented Oct 27, 2025

Uh oh!

WanRui37 commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

【Hackathon 9th No.91】FastDeploy中的MoE GroupGEMM支持INT8*INT8实现 #1164

Are you sure you want to change the base?

【Hackathon 9th No.91】FastDeploy中的MoE GroupGEMM支持INT8*INT8实现 #1164

Uh oh!

Conversation

WanRui37 commented Oct 16, 2025

Uh oh!

paddle-bot bot commented Oct 16, 2025

Uh oh!

ckl117 Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

WanRui37 Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

ckl117 commented Oct 27, 2025

Uh oh!

WanRui37 commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants