Skip to content

Commit 5926c13

Browse files
committed
Add Efficient Online Training with GRPO and vLLM in TRL recipe
1 parent e0f66fe commit 5926c13

File tree

3 files changed

+2488
-3
lines changed

3 files changed

+2488
-3
lines changed

notebooks/en/_toctree.yml

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -88,8 +88,9 @@
8888
title: Hyperparameter Optimization with Optuna and Transformers
8989
- local: function_calling_fine_tuning_llms_on_xlam
9090
title: Fine-tuning LLMs for Function Calling with the xLAM Dataset
91-
92-
91+
- local: grpo_vllm_online_training
92+
title: Efficient Online Training with GRPO and vLLM in TRL
93+
9394

9495
- title: Computer Vision Recipes
9596
isExpanded: false

notebooks/en/grpo_vllm_online_training.ipynb

Lines changed: 2484 additions & 0 deletions
Large diffs are not rendered by default.

notebooks/en/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,11 +7,11 @@ applications and solving various machine learning tasks using open-source tools
77

88
Check out the recently added notebooks:
99

10+
- [Efficient Online Training with GRPO and vLLM in TRL](grpo_vllm_online_training)
1011
- [Fine-tuning LLMs for Function Calling with the xLAM Dataset](function_calling_fine_tuning_llms_on_xlam)
1112
- [Post training an VLM for reasoning with GRPO using TRL](fine_tuning_vlm_grpo_trl)
1213
- [TRL GRPO Reasoning with Advanced Reward](trl_grpo_reasoning_advanced_reward)
1314
- [Fine-Tuning a Vision Language Model with TRL using MPO](fine_tuning_vlm_mpo)
14-
- [Fine tuning a VLM for Object Detection Grounding using TRL](fine_tuning_vlm_object_detection_grounding)
1515

1616
You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).
1717

0 commit comments

Comments
 (0)