Pinned Loading
-
Trinity-RFT
Trinity-RFT PublicForked from modelscope/Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).
Python
-
modelscope/data-juicer
modelscope/data-juicer PublicData processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
-
FederatedScope
FederatedScope PublicForked from alibaba/FederatedScope
An easy-to-use federated learning platform
Python
-
RFT-Math-Eval
RFT-Math-Eval PublicForked from LeapLabTHU/limit-of-RLVR
repo for paper https://arxiv.org/abs/2504.13837
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.