How should I explain Reinforcement Learning to a 4-year-old?
How should I explain Reinforcement Learning to a 4-year-old?
A robot takes a big step forward, then falls. The next time, it takes a smaller step and is able to hold its balance. The robot tries variations like this many times; eventually, it learns the right size of steps to take and walks steadily. It has succeeded.
What we see here is called reinforcement learning. It directly connects a robot's action with an outcome, without the robot having to learn a complex relationship between its action and results. The robot learns how to walk based on reward (staying on balance) and punishment (falling). This feedback is considered "reinforcement" for doing or not doing an action.
Reinforcement learning, in a simplistic definition, is learning best actions based on reward or punishment.