Hello, welcome to A.I.
Playpen, human versus A.I..
I show how well the AI plays retro games,
and compare them to humans.
In this video, the goal is to see who wins
the game
"Circus Charlie".
I will compare one human beginner, one human
expert, A.I. beginner, after short-term training,
A.I. intermediate, after medium-term training,
and A.I. expert, after long-term training.
I trained the A.I. with reinforcement learning.
I used the proximal policy optimization algorithm.
The network was the standard convolutional
neural network with 3 internal layers.
I used the OpenAI baselines toolkit, and the
OpenAI retro environment.
The data inputs were the stream of 2D pixel
images and rewards.
In this comparison, human expert wins.
Thanks for watching this video.
