Skip to content

Conversation

@qgallouedec
Copy link
Member

vLLM has proven to be unstable, frequently breaking TRL even with patch releases. The upcoming vLLM 0.11 (not released at the time of writing) is expected to break GuidedDecodingParams (see vllm-project/vllm#22772) without prior notice or deprecation. It will also include "many other" brekaking changes, quote a vLLM maintainer.

To mitigate this and ensure stability for TRL users, we propose a new approach for managing the vLLM dependency specifically: explicitly specify the supported versions and assume that any newer release—even a patch—may break the codebase. The list of supported versions will only be extended once the community has validated that the new version does not break, or introduce regressions.

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@qgallouedec qgallouedec changed the title Pin vLLM version 📌 Pin vLLM version Sep 23, 2025
Copy link
Member

@albertvillanova albertvillanova left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good decision. I remember doing something similar for fsspec dependency in datasets: this approach saves us from having to release a hotfix after one of their breaking releases.

@qgallouedec qgallouedec merged commit 251fdb2 into main Sep 23, 2025
9 of 12 checks passed
@qgallouedec qgallouedec deleted the pin-vllm branch September 23, 2025 14:02
qgallouedec added a commit that referenced this pull request Sep 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants