- 👋 Hi, I’m @none0663
- 🧠 Reinforcement Learning Specialist with 6 years of hands-on experience
- ⚡ Interests: RL algorithms design & RLHF (Reinforcement Learning from Human Feedback)
- 🤝 Seeking collaboration: Open-source RL toolkits, human-AI alignment projects, and novel RLHF applications
- 📫 Let's connect: [email protected]
- 💡 Fun fact: Trained an RL agent and RLHF
- 🌱 Always learning: Latest papers on reinforcement learning and ethical AI alignment
Popular repositories Loading
-
-
PARL
PARL PublicForked from PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
Python
-
verl
verl PublicForked from volcengine/verl
veRL: Volcano Engine Reinforcement Learning for LLM
Python 1
-
OpenRLHF
OpenRLHF PublicForked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python
If the problem persists, check the GitHub status page or contact support.