Reinforcement learning is a type of machine learning where an agent learns to make decisions by taking actions in an environment to maximize cumulative rewards over time. This learning process involves exploration and exploitation, where the agent must balance trying new actions and using known ones that yield high rewards. It's deeply tied to concepts of decision-making biases and cognitive limitations, as well as applications in AI, especially when multiple agents interact with each other in complex environments.