Multi-Agent Reinforcement Learning Algorithm

This project implements a value iteration-based multi-agent reinforcement learning algorithm to solve the Nash equilibrium problem in multi-agent systems.

Algorithm Overview

The algorithm is based on the following key components:

Value Iteration: Used to update Q-values and optimal policies for each agent.
Actor-Critic Network: Approximates the value function and policy function.
Gradient Clipping: Prevents gradient explosion problems.
Adaptive Learning Rate: Decays over time to ensure algorithm convergence.

Main Functions

main_simulation(): Main simulation loop
value_iteration(): Performs value iteration updates
compute_Mi(): Calculates the Mi matrix for each agent
actor_critic_network(): Implements the Actor-Critic network
tracking_error(): Computes tracking errors
system_dynamics(): Simulates system dynamics

Simulation Results

Here are some key results from running the algorithm:

1. Critic and Actor Weight Changes

This graph shows how the weights of the Critic and Actor networks change over time.

2. Tracking Error Dynamics

This graph displays the tracking errors for each agent over time.

3. Agent State Changes

This graph demonstrates how the state of each agent changes over time.

4. Control Input Changes

This graph shows how the control inputs for each agent change over time.

Running Instructions

Ensure your MATLAB environment is properly configured.
Run the main_simulation() function to start the simulation.
Results will be automatically saved in the result directory.

Notes

Algorithm performance may be affected by initial parameter settings.
For large-scale systems, adjustment of learning rates and iteration numbers may be necessary.

Future Work

Implement more complex reward functions
Explore other types of Actor-Critic architectures
Test algorithm performance on real physical systems

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
lib		lib
result		result
src		src
tests		tests
LICENSE		LICENSE
README.md		README.md
create_project_structure.sh		create_project_structure.sh
setup.m		setup.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Agent Reinforcement Learning Algorithm

Algorithm Overview

Main Functions

Simulation Results

1. Critic and Actor Weight Changes

2. Tracking Error Dynamics

3. Agent State Changes

4. Control Input Changes

Running Instructions

Notes

Future Work

About

Uh oh!

Releases

Packages

Languages

License

chenxingqiang/Nash_Equilibrium_MultiAgent

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Reinforcement Learning Algorithm

Algorithm Overview

Main Functions

Simulation Results

1. Critic and Actor Weight Changes

2. Tracking Error Dynamics

3. Agent State Changes

4. Control Input Changes

Running Instructions

Notes

Future Work

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages