ELI5: Explain Like I'm 5

Temporal difference learning

Temporal difference (TD) learning is a type of machine learning that tells a computer how to learn from experience. In TD learning, the computer looks at its current state (like a game it is playing) and then compares it to what happened a few seconds ago. This allows it to learn from its mistakes and make better decisions in the future. This makes it possible for a computer to learn how to play a game without being explicitly told what to do.