GitHub - lksshw/IPDm: iterated prisoners dilemma with a varying number of agents

Spatialized Iterated Prisoners Dilemma with a Varying Number of Agents

This repository contains code corresponding to the work titled: "Extending Iterated, Spatialized Prisoners’ Dilemma to Understand Multicellularity: game theory with self-scaling players"

The general idea is to run a simulation where independent RL agents play Iterated prisoners dilemma (IPD) games with one another. But with a twist: instead of their restricted ability to cooperate or defect, agents in-addition, have the option to merge or split. These operators allow agents the ability to forego their individuality in exchange for that of another, better performant individual, with their 2D spatial arrangement growing/shrinking accordingly.

Here's a visualization: (the colorbar is representative of agents' size).

The interactive demo might provide a better understanding of the matter.

Simulation

In a virtualenv:

pip3 install -r metadata/requirements.txt

A simulation of merge based IPD (termed ipd-ms) can be run as:

python3 ipd-ms.py --mode "fixed" --mem_len 4 -bs 20

and,

A simulation of classic IPD (with cooperate and defect as possible actions) can be run as:

python3 ipd.py --mode "fixed" --mem_len 4 -bs 20

These hyperparameters require some background on the design of our RL agents:

Each RL agent carries with it two data structures: a list of its memories (actions played so far), and a policy table (discrete map from memory states to actions).
The memory size (size of the list) can be pre-set to a fixed capacity (represented by the --mem_len hyperparameter).
Given that each simulation involves multiple agents (determined by the -bs hyperparmeter), the --mode hyperparameter provides a way to set the --mem_len hyperparameter of each agent.

Specifically, in the code snippet above:

The --mode option specifies if agents are to be uniformly set to a constant memory capacity (mode = fixed) or heterogenously set to sizes from a uniform random distribution (mode = range_memory ) whose upper-bound is decided by the --mem_len parameter (--mem_len = 4, set here).
The -bs option specifies the number of RL agents to initialize. Here we set it to a value of 20 (implying a 20x20 grid composed of 400 agents).

The file, hyperparams.py maintains a list of invariant hyperparameters values related to the q-learning algorithm, mutation rate etc.

Results

The results we report focuses on the relationship between memory size (varied by the --mem_len option) and merge tendency (varied by enabling or disabling merge/split actions: by running either ipd-ms or ipd simulations)

running plot.sh replicates our result:

cd metadata
chmod +x plot.sh
./plot.sh

note: plot.sh fetches cached simulation data (running them from scratch is time-consuming). A re-run of these simulations should yield ~the same results.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
core		core
metadata		metadata
ipd-ms.py		ipd-ms.py
ipd.py		ipd.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spatialized Iterated Prisoners Dilemma with a Varying Number of Agents

Simulation

Results

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Spatialized Iterated Prisoners Dilemma with a Varying Number of Agents

Simulation

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages