Add Efficient Online Training with GRPO and vLLM in TRL recipe

sergiopaniego · sergiopaniego · commit 5926c1364682 · 2025-10-01T18:07:07.000+02:00
diff --git a/notebooks/en/_toctree.yml b/notebooks/en/_toctree.yml
@@ -88,8 +88,9 @@
           title: Hyperparameter Optimization with Optuna and Transformers
         - local: function_calling_fine_tuning_llms_on_xlam
           title: Fine-tuning LLMs for Function Calling with the xLAM Dataset
-          
-        
+        - local: grpo_vllm_online_training
+          title: Efficient Online Training with GRPO and vLLM in TRL
+
           
     - title: Computer Vision Recipes
       isExpanded: false
diff --git a/notebooks/en/grpo_vllm_online_training.ipynb b/notebooks/en/grpo_vllm_online_training.ipynb
diff --git a/notebooks/en/index.md b/notebooks/en/index.md
@@ -7,11 +7,11 @@ applications and solving various machine learning tasks using open-source tools
 
 Check out the recently added notebooks:
 
+- [Efficient Online Training with GRPO and vLLM in TRL](grpo_vllm_online_training)
 - [Fine-tuning LLMs for Function Calling with the xLAM Dataset](function_calling_fine_tuning_llms_on_xlam)
 - [Post training an VLM for reasoning with GRPO using TRL](fine_tuning_vlm_grpo_trl)
 - [TRL GRPO Reasoning with Advanced Reward](trl_grpo_reasoning_advanced_reward)
 - [Fine-Tuning a Vision Language Model with TRL using MPO](fine_tuning_vlm_mpo)
-- [Fine tuning a VLM for Object Detection Grounding using TRL](fine_tuning_vlm_object_detection_grounding)
 
 You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).