Reinforcement Learning: An Introduction¶ 约 45 个字 预计阅读时间不到 1 分钟 Table of Contents¶ Chapter 1: Introduction Chapter 2: Multi-armed Bandits Chapter 3: Finite Markov Decision Processes Chapter 4: Dynamic Programming Chapter 5: Monte Carlo Methods Chapter 6: Temporal-Difference Learning Chapter 7: n-step Bootstrapping Chapter 8: Planning and Learning with Tabular Methods