Temporal difference learning

Temporal difference (TD) learning is a type of machine learning that tells a computer how to learn from experience. In TD learning, the computer looks at its current state (like a game it is playing) and then compares it to what happened a few seconds ago. This allows it to learn from its mistakes and make better decisions in the future. This makes it possible for a computer to learn how to play a game without being explicitly told what to do.

Related topics others have asked about:

PVLV, Q-learning, Rescorla–Wagner model, State–action–reward–state–action