This is a repository for implementing RL basic code from David Silver's https://www.davidsilver.uk/teaching/
- GLIE Monte Carlo Control
- SARSA
- n-step SARSA
- backward SARSA
- Q-Learning
- GLIE Monte Carlo Control Value Function Approximation
- SARSA Value Function Approximation
- n-step SARSA Value Function Approximation
- backward SARSA Value Function Approximation
- Q-Learning Value Function Approximation
- Monte Carlo Control Policy
- Action Value Actor Critic
Plot maze with Dyna-Q Algorithm
8ILP_cute.py : maze with cute ver
8ILP_color.py : maze with color ver