This repository contains DQN algorithm for pogema. Algorithm uses logger for training on previous experiments and two NNs: target net and policy net. Policy net is being training every training step and once in TARGET_UPDATE steps is being logged into target net for stable learning. File vis.py contains script for visualizing results into .svg file.
SuperCrabLover/DQN_For_Pogema
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|