none0663

Follow

none0663

Follow

2 followers · 0 following

Achievements

Achievements

none0663/README.md

👋 Hi, I’m @none0663
🧠 Reinforcement Learning Specialist with 6 years of hands-on experience
⚡ Interests: RL algorithms design & RLHF (Reinforcement Learning from Human Feedback)
🤝 Seeking collaboration: Open-source RL toolkits, human-AI alignment projects, and novel RLHF applications
📫 Let's connect: [email protected]
💡 Fun fact: Trained an RL agent and RLHF
🌱 Always learning: Latest papers on reinforcement learning and ethical AI alignment

Popular repositories Loading

none0663 none0663 Public

Config files for my GitHub profile.
PARL PARL Public

Forked from PaddlePaddle/PARL

A high-performance distributed training framework for Reinforcement Learning

Python
verl verl Public

Forked from volcengine/verl

veRL: Volcano Engine Reinforcement Learning for LLM

Python 1
OpenRLHF OpenRLHF Public

Forked from OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python
Megatron-LM Megatron-LM Public

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python