Ping Pong

This is another example of Reinforcement learning. The player on right (the red one) is an intelligent player that is trying to determine the hidden strategy by playing multiple games with the machine player (the yellow one). When the ball crosses the midline, it changes color. The red player should hit or miss a ball. The left side switches allow you to configure (and change at runtime) the policy when the player should hit and when to miss. After playing a few rounds, the player on right determines this policy. You can notice the learning (and learned) stage by monitoring the checkboxes on the right side of the page.

View Ping Pong Demo

Error message