LLM:RLHF alog and infra
Quantitative Invest
Deep Learning: time series data, image&video target detection, tracking and segmentation
-
alibaba-inc
- Beijing, China
Highlights
- Pro
Pinned Loading
-
alibaba/ROLL
alibaba/ROLL PublicAn Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.