Markov Decision Processes: Multi Armed Bandit

This is a Multi-Armed Bandit Web App on Streamlit. This app was a part of my Talk on Markov Decision Processes.

Game Screenshots

Intro	Game Play (when hint is clicked)

Results (Optimal vs Player)	Results (Reveal Button Rewards)

Epsilon Greedy Simulation

Features

A Nice Intro Page with introduction and hints
Three-Armed Bandit Implementation which the player can try out on his own.
A fully functional Results page with comparison to the Optimal Performance and Arm's Reward Dynamics.
Epsilon Greedy Agent Implementation on a 3-Arm Testbed (or user chosen Arm Reward Distributions)

Usage

To run the WebApp, follow the instructions below,

# Clone Repo
git clone https://github.com/RishiDarkDevil/MDP-Multi-Armed-Bandit.git
cd MDP-Multi-Armed-Bandit

# Install Prerequitesites
pip install streamlit numpy pandas matplotlib scipy

# Run WebApp
streamlit run MAB.py

Note: For the Agents Page, after running the simulation if you want to change the parameters and run again, as of now first stop the running on the top right corner of streamlit and then set the parameters and run again (Needs to be fixed).

Upcoming

Host the Application, with a Leaderboard.
Allow the Host to change the Arm Dynamics which is currently hard-coded.
Make it more interesting with changing Reward Distribution (But not too complicated as humans will play)
Generalize to K-Arms.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
pages		pages
.DS_Store		.DS_Store
LICENSE		LICENSE
MAB.py		MAB.py
README.md		README.md
buttondist.png		buttondist.png
egreedy.png		egreedy.png
hints.png		hints.png
intro.png		intro.png
results.png		results.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Markov Decision Processes: Multi Armed Bandit

Game Screenshots

Epsilon Greedy Simulation

Features

Usage

Upcoming

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Markov Decision Processes: Multi Armed Bandit

Game Screenshots

Epsilon Greedy Simulation

Features

Usage

Upcoming

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages