Skip to content

Conversation

@quic-shagun
Copy link
Contributor

This PR adds support for zai-org/GLM-4.5-Air model.
Open source MoE model with performance and accuracy better than many closed source models:

image

@quic-rishinr
Copy link
Contributor

quic-rishinr commented Nov 18, 2025

@shagsood do we have approval for this model? also do add this model under validated model list

@quic-rishinr
Copy link
Contributor

@vbaddi can you please review this PR?

@quic-rishinr quic-rishinr requested a review from vbaddi November 18, 2025 09:17
@quic-sgunnala
Copy link

@shagsood do we have approval for this model? also do add this model under validated model list

Yes we have legal approval for this model.


class QEffGlm4MoeMoE(Glm4MoeMoE):
"""
MoE Block
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: We can start using our optimized moe block for prefill/decode usecase here?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants