Multi-Armed Bandits and Reinforcement Learning | Machine Learning Engineering Class Notes