Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add GRPO Wordle OpenEnv Colab
#4542 opened Nov 18, 2025 by sergiopaniego Loading…
5 tasks
[OpenEnv] browsergym example script
#4539 opened Nov 18, 2025 by kashif Loading…
5 tasks
Add target_parameters to LoraConfig
#4536 opened Nov 18, 2025 by jonnyli1125 Loading…
5 tasks
Add compute_metrics parameter for GRPOTrainer
#4534 opened Nov 17, 2025 by colinzhaoxp Loading…
Add OpenEnv Script examples to docs
#4533 opened Nov 17, 2025 by sergiopaniego Loading…
5 tasks
[GRPO] Sequence-level TIS & MIS
#4530 opened Nov 16, 2025 by LeonEricsson Loading…
5 tasks
Add Qwen3VLGRPOTrainer for Qwen3-VL GRPO training
#4529 opened Nov 16, 2025 by NDNM1408 Loading…
Make skip_special_tokens configurable
#4521 opened Nov 13, 2025 by taha-yassine Loading…
3 of 5 tasks
fix tokenize bug for ppo_tldr example
#4520 opened Nov 13, 2025 by kaixuanliu Loading…
[GRPO] switch grpo liger loss to triton version
#4519 opened Nov 13, 2025 by kashif Draft
5 tasks
adding [SimPER](https://arxiv.org/abs/2502.00883)
#4486 opened Nov 6, 2025 by leeparkuky Loading…
2 of 5 tasks
Add attention_mask to signature_columns
#4459 opened Nov 5, 2025 by shubhamjain0594 Loading…
5 tasks
added 10 papers (+trainer cross-links) for #4407
#4441 opened Nov 3, 2025 by SSusantAchary Loading…
4 tasks done
docs: Expand training customization examples
#4427 opened Nov 2, 2025 by behroozazarkhalili Loading…
4 tasks done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.