Skip to content

Releases: leggedrobotics/rsl_rl

v3.1.0

18 Sep 08:05
Compare
Choose a tag to compare

Overview

Full Changelog: v3.0.1...v3.1.0

Added

  • Adds state-dependent standard deviation for the PPO actor by @iakinola23 in #112

Fixed

New Contributors

v3.0.1

01 Sep 13:02
2fc1f78
Compare
Choose a tag to compare

Overview

Full Changelog: v3.0.0...v3.0.1

Fixed

v3.0.0

18 Jul 09:46
Compare
Choose a tag to compare

Overview

RSL RL now supports observation dictionaries using the TensorDict library. Different observation groups with different shapes can thus be handled seemlessly, e.g., vision inputs. To assign different observation groups to the correct part of the policy, a dictionary maps

Additionally, the code has been refactored to be more modular and flexible. The main changes are:

  • An additional runner class for student-teacher distillation
  • An MLP class that can be used to build custom policies
  • Normalization is now part of the policy and can be set for different parts, e.g., actor and critic, seperately.

Full Changelog: v2.3.3...v3.0.0

Added

Breaking Changes

  • Isaac Lab does not yet support the new observation handling. There is an open PR (isaac-sim/IsaacLab#2962) that can be used until the changes are merged.

New Contributors

v2.3.3

30 May 10:46
750e845
Compare
Choose a tag to compare

Overview

Full Changelog: v2.3.2...v2.3.3

Fixed

v2.3.2

14 May 16:15
Compare
Choose a tag to compare

Overview

Full Changelog: v2.3.1...v2.3.2

Added

Fixed

  • Fixes unexpected keyword argument learning_rate within RandomNetworkDistillation by @ozhanozen in #87

New Contributors

v2.3.1

09 Apr 07:37
d38a378
Compare
Choose a tag to compare

Overview

Full Changelog: v2.3.0...v2.3.1

Added

Fixed

  • Fixes git repository code storage function by @Mayankm96 in #83
  • Fixes padding shape in split_and_pad_trajectories to support arbitrary additional dimensions by @bikcrum in #77
  • Disable distribution mean gradient propagation into action noise std for StudentTeacher by @flferretti in #82

New Contributors

v2.3.0

26 Mar 15:58
Compare
Choose a tag to compare

Overview

RSL RL now supports distributed training. Additionally, a new distillation algorithm allows for student-teacher training.

Full Changelog: v2.2.4...v2.3.0

Added

Fixed

Breaking Changes

  • Renamed actor_critic to policy to be more general and align with other architectures and algorithms by @ClemensSchwarke in bbce4ef

v2.2.4

05 Mar 11:56
Compare
Choose a tag to compare

Overview

Full Changelog: v2.2.3...v2.2.4

Fixed

New Contributors

v2.2.3

25 Feb 18:27
Compare
Choose a tag to compare

Overview

This release adds some new parameters to PPO which help make the training more stable.

Full Changelog: v2.2.2...v2.2.3

Added

  • Adds flag for per-batch advantage normalization by @Mayankm96 in #68
  • Adds support for log-std parameter in ActorCritic by @Mayankm96 in #67

Fixed

  • Fixes mean_entropy logging by dividing by num_updates by @bikcrum in #65
  • Corrects disabling of arguments when creating Normal distribution by @Mayankm96 in #69

New Contributors

v2.2.2

24 Feb 18:00
Compare
Choose a tag to compare

Overview

Full Changelog: v2.2.1...v2.2.2

Fixed

  • Fixes bug in ActorCriticRecurrent hidden state reset by @jnskkmhr in #50
  • Stops gradient propagation through ActorCritic std-dev by @Mayankm96 in #66
  • Removes unused attributes from VecEnv in 8818338
  • Fixes weight schedule dict for RND in 6909a47

New Contributors