Hello, welcome to A.I.
Playpen, human versus A.I..
I show how well the AI plays retro games,
and compare them to humans.
In this video, the goal is to see who wins
the game
"Gradius".
I compare one human expert, A.I. beginner,
after short-term training, A.I. intermediate,
after medium-term training, and A.I. expert,
after long-term training.
I trained
the A.I. with reinforcement learning.
I used the proximal policy optimization algorithm.
The network was the standard convolutional
neural network with 3 internal layers.
I used the OpenAI baselines toolkit, and the
OpenAI retro environment.
The data inputs were the stream of 2D pixel
images and rewards.
In this comparison, Human expert wins.
Sadly, the A.I. didn’t catch up the “power-up
system” of Gradius.
To properly power-up, it must wait until it
obtains multiple items,
But it only obtained speed-ups by pressing
B button randomly.
Sudden speed-up is harmful, because it’s
like an unexpected environment change to A.I..
Thus, it had to fight with a “bare airship”
all the way through.
Moreover, the mid boss (volcanoes) was terrible
to A.I. because the game environment suddenly
changed.
Thanks for watching this video.
