Chapter 7: n-step Bootstrapping¶ 约 32 个字 预计阅读时间不到 1 分钟 7.1 n-step TD Prediction¶ 7.2 n-step Sarsa¶ 7.3 n-step Off-Policy Learning¶ 7.4 Per-decision Methods with Control Variates¶ 7.5 Off-policy Learning without Importance Sampling: The n-step Tree Backup Algorithm¶