Class 01 - Q&A! #25
Replies: 4 comments 7 replies
-
2 great videos on a bit of motivation for the physics as an optimization topic: |
Beta Was this translation helpful? Give feedback.
-
We had a great question after the first class from someone with an (NN-Based) RL background:
My understanding of their question: I presented the multi-stage problem as an expanding tree where you have to both consider the feasibility of your decisions and cleverly manage how to tackle the impact of future (uncertain) stages: In (Deep) RL literature, Constrained RL methods deal with state constraints using modified versions of the penalty method we will mention in the 2nd lecture. I am sure there must be clever (yet complex) methods that can converge for a few special cases, but my interpretation of their question is that there is this overall idea among some people that you can just use slightly modified versions of classical RL methods to solve any constrained multi-stage problem. It would be great to show when penalty methods fail or even when they take incredibly longer to converge than the alternative. |
Beta Was this translation helpful? Give feedback.
-
Another question we had after the first class came from someone with an optimization background: For mechanical systems in which the Lagrangian is given by where The student posed:
|
Beta Was this translation helpful? Give feedback.
-
Can you describe how to derive the coordinate transformation in the unicycle example? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
👋 Welcome!
We’re using Discussions as a place to:
build together 💪.
Beta Was this translation helpful? Give feedback.
All reactions