Q-learning is a machine learning technique. It is a way for a computer to figure out how to take the best possible action in a given situation. To do this, it looks at different possible choices it could make, and assigns each of those choices a value based on how good or bad it would be. It then chooses the choice with the highest value.