SimpleMethod

This is a pretty straight forward method - where I simple instantiate a weight matrix and multiple with the states to get rewards over a period and then try to maximise the reward. For discrete state and action space.

Projects solved:

1 - Cartpole - For info on the environment - https://github.com/openai/gym/wiki/CartPole-v0

Name		Name	Last commit message	Last commit date
parent directory ..
cartpole		cartpole
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Readme.md

FilesExpand file tree

SimpleMethod

Directory actions

More options

Directory actions

More options

Latest commit

History

SimpleMethod

Folders and files

parent directory

Readme.md