Reinforcement Learning (Sutton & Barto)June 8, 2026-core textbook notes on reinforcement learning fundamentals, from Markov decision processes through temporal-difference learning-imported from Notion and organized into chapter pagesChapters1.Chapter 1: Introduction2.Chapter 2: Multi-armed Bandits3.Chapter 3: Finite Markov Decision Processes (MDPs)4.Chapter 4: Dynamic Programming5.Chapter 5: Monte Carlo Methods6.Chapter 6: Temporal Difference Learning