Q-learning is a model-free reinforcement learning algorithm that helps an agent learn the value of actions in a given state to maximize the cumulative reward over time. It utilizes a Q-table to store the expected utility of taking a certain action in a specific state and updates this table based on the rewards received, making it adaptable to dynamic environments.
congrats on reading the definition of q-learning. now let's actually learn it.