Project to start with RL
- OpenAI Gym
- Simple Bandit
- Tracking Bandit
- UCB Bandit
- Gradient Bandit
- Iterative Policy Evaluation
- Policy Iteration
- Value Iteration
- Monte-Carlo method
- Temporal-Difference method
- Policy Gradient method
- On-policy method
- Off-policy method
- Deep-RL
- Cart-pole
- Frozen lake
- Maze solver