Skip to content

ghimiremukesh/rl_course

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rl_course

Course Repo for Reinforcement Learning Offered by Prof. Dmitri Bertsekas

Spider and Flies - Multi-Agent RL Problem

problem

Using Base Policy

Using base policy with the given starting point of the spiders and flies, the spiders caught the flies in 25 moves. The code to replicate can be found here

gif

Using Standard Rollout

The Standard Rollout algorithm minimizes over $5^2$ joint control space at each state using one-step look-ahead minimization and terminal cost approximation using base policy. It took 16 moves to capture all the flies.

gif_std

Using Multi-agent Rollout

In Multi-agent Rollout, only one agent moves at a time and performs one-step look-ahead minimization and terminal cost approximation using base policy. Using multi-agent rollout, it took 33 joint moves as only one agent moves at a time. Invidual spider moves were 17 and 16 for spider 1 and 2 respectively.

gif2

About

Course Repo for Reinforcement Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published