Dynamic-Programming

Notebook 1

My attempt to solve problem 4.9 from Sutton and Barto's textbook, which demands solving the gambler's problem using the value iteration method. All of the configs are based on the assumptions of the exercise.

Notebook 2

Solving this problem with Policy Iteration method.

Name		Name	Last commit message	Last commit date
parent directory ..
Policy Iteration		Policy Iteration
Value Iteration		Value Iteration
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Notebook 1

Notebook 2

FilesExpand file tree

Dynamic-Programming

Directory actions

More options

Directory actions

More options

Latest commit

History

Dynamic-Programming

Folders and files

parent directory

README.md

Notebook 1

Notebook 2