Skip to content

Conversation

yingcanw
Copy link
Collaborator

No description provided.

yingcanw and others added 14 commits December 24, 2024 03:48
* init v3 lite feat

* fix moe topk method

* fix noaux_tc logic

* fix deepseek v3 normal rope

* refactor

* wo conversion ok debugging build

* add quantize for attn.dense

* add unified converter support

* testing unified converter

* add convert checkpoint and update docs

---------

Co-authored-by: Zeyu Wang <[email protected]>
@nv-guomingz
Copy link
Collaborator

LGTM

@nv-guomingz nv-guomingz merged commit f529c1c into NVIDIA:deepseek Jan 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants