Reinforcement learning (RL) is an interdisciplinary area of machine
learning and optimal control concerned with how an intelligent agent should
take actions in a dynamic environment in order to maximize a reward signal.
Reinforcement learning is one of the three basic machine learning paradigms,
alongside supervised learning and unsupervised learning.
Reinforcement learning differs from supervised learning in not needing
labelled input-output pairs to be presented, and in not needing sub-optimal
actions to be explicitly corrected.