Skip to content

Conversation

yingcanw
Copy link
Collaborator

@yingcanw yingcanw commented Jan 2, 2025

No description provided.

yingcanw and others added 8 commits December 24, 2024 03:48
* init v3 lite feat

* fix moe topk method

* fix noaux_tc logic

* fix deepseek v3 normal rope

* refactor

* wo conversion ok debugging build

* add quantize for attn.dense

* add unified converter support

* testing unified converter

* add convert checkpoint and update docs

---------

Co-authored-by: Zeyu Wang <[email protected]>
@yingcanw yingcanw merged commit 718ef13 into NVIDIA:deepseek Jan 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants