Hello, we are going to test the DQN after training the network for about 5 million steps
Now I show you the output of the undergoing training process
You can see the steps, average reward from the console.
OK, here we go, the game starts
I maximize the play windows so you can see it clearly.
The code will play the game for several times, ie, when a game is terminated, it will starts a new one, but finally will end..
You can see how the agent performs, the score is at the bottom.
Enjoy it, if you are unpatient, just stop the video when you think it's enough. It not takes much time.
Thanks for watching ^_^
