Skip to content

Conversation

@behroozazarkhalili
Copy link
Collaborator

This PR moves PRMTrainer and PRMConfig to the experimental module as part of the TRL V1 refactoring effort.

Changes

  • Create trl.experimental.prm module with PRMTrainer and PRMConfig
  • Add deprecation stubs in trl.trainer with FutureWarning (removal in TRL 0.29.0)
  • Update imports in tests, examples, and documentation
  • Move PRM to Experimental section in docs/_toctree.yml

Migration pattern

Follows the same pattern used for PPO migration.

Contributes to #4374
Fixes #4467

- Create trl.experimental.prm module with PRMTrainer and PRMConfig
- Add deprecation stubs in trl.trainer with FutureWarning (removal in TRL 0.29.0)
- Update imports in tests, examples, and documentation
- Move PRM to Experimental section in docs/_toctree.yml

Contributes to #4374
Fixes #4467
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

- Updated trainer references in dataset_formats.md and example_overview.md
- Moved test file to tests/experimental/ directory
- Updated test imports from relative to parent directory
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Move PRMTrainer to trl.experimental

3 participants