rl_course

Course Repo for Reinforcement Learning Offered by Prof. Dmitri Bertsekas

Spider and Flies - Multi-Agent RL Problem

Using Base Policy

Using base policy with the given starting point of the spiders and flies, the spiders caught the flies in 25 moves. The code to replicate can be found here

Using Standard Rollout

The Standard Rollout algorithm minimizes over $5^2$ joint control space at each state using one-step look-ahead minimization and terminal cost approximation using base policy. It took 16 moves to capture all the flies.

Using Multi-agent Rollout

In Multi-agent Rollout, only one agent moves at a time and performs one-step look-ahead minimization and terminal cost approximation using base policy. Using multi-agent rollout, it took 33 joint moves as only one agent moves at a time. Invidual spider moves were 17 and 16 for spider 1 and 2 respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Spiders_Flies_MARL		Spiders_Flies_MARL
.gitignore		.gitignore
README.md		README.md
Spiders_Flies_MARL.ipynb		Spiders_Flies_MARL.ipynb
problem.png		problem.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

rl_course

Spider and Flies - Multi-Agent RL Problem

Using Base Policy

Using Standard Rollout

Using Multi-agent Rollout

About

Uh oh!

Releases

Packages

Languages

ghimiremukesh/rl_course

Folders and files

Latest commit

History

Repository files navigation

rl_course

Spider and Flies - Multi-Agent RL Problem

Using Base Policy

Using Standard Rollout

Using Multi-agent Rollout

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages