Level-Based Foraging (LBF): Multi-Agent Reinforcement Learning (TorchRL)

About the project

Multi-Agent Communication with PPO (TorchRL + Jumanji)

The aim of the project is learned communication between cooperative agents in a multi-agent reinforcement learning setting. Using PPO and TorchRL, agents with and without communication are compared in the Level-Based Foraging environment.

The goal is to implement and test communication in this environment and to see how it helps he agents cooperate in the task.

Overview

Environment: LevelBasedForaging-v0 (2 agents)
Algorithms:
- MAPPO (centralized critic)
- IPPO (independent critics)
Communication:
- Discrete symbols with a fixed vocabulary size, messages are appended to observations in the next time step

Experiments

Three configurations are being tested with both MAPPO and IPPO

- No Communication

- Communication with a vocabulary size of 4

- Communication with a vocabulary size of 1 (to compare with no communication)

What is implemented

Multi-agent PPO training loop using TorchRL
Discrete communication channel
Message passing via agent-to-agent swapping
Centralized and decentralized critics
Logging and visualization of:
- Episode reward curves
- Message entropy over training
- Message token usage distributions

Results

The results of the experiments show that the agents have not improved at all with communication during and after training. This is possibly because of an undetected fault in the code that breaks the learning process.

Recreation of results

Currently, because of some limitaions the notebook can only be run in Google Colab.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
documents		documents
plots		plots
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Level-Based Foraging (LBF): Multi-Agent Reinforcement Learning (TorchRL)

About the project

Multi-Agent Communication with PPO (TorchRL + Jumanji)

Overview

Experiments

What is implemented

Results

Recreation of results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Level-Based Foraging (LBF): Multi-Agent Reinforcement Learning (TorchRL)

About the project

Multi-Agent Communication with PPO (TorchRL + Jumanji)

Overview

Experiments

What is implemented

Results

Recreation of results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages