跳转至

V1CeVersa's Notebook

Reinforcement Learning: An Introduction

正在初始化搜索引擎

V1CeVersaa/notebook

Home
Computer Science
System
AI
RL
Math
Varia

V1CeVersa's Notebook

V1CeVersaa/notebook

Home
Computer Science
System
AI
RL
RL
- Barto & Sutton
  Barto & Sutton
  - Chapter 1
  - Chapter 2
  - Chapter 3
  - Chapter 4
  - Chapter 5
  - Chapter 6
  - Chapter 7
  - Chapter 8
  - Chapter 9
  - Chapter 10
  - Chapter 11
  - Chapter 12
  - Chapter 13
  - Chapter 14
  - Chapter 15
  - Chapter 16
- Deep RL Intro
- UCB-CS285
- RL: Theory & Algorithms
- Gymnasium
- Stable Baselines3
Math
Varia

目录

Table of Contents

Reinforcement Learning: An Introduction¶

约 87 个字预计阅读时间不到 1 分钟

Table of Contents¶

Chapter 1: Introduction
Chapter 2: Multi-armed Bandits
Chapter 3: Finite Markov Decision Processes
Chapter 4: Dynamic Programming
Chapter 5: Monte Carlo Methods
Chapter 6: Temporal-Difference Learning
Chapter 7: n-step Bootstrapping
Chapter 8: Planning and Learning with Tabular Methods
Chapter 9: On-Policy Prediction with Approximation
Chapter 10: On-Policy Control with Approximation
Chapter 11: Off-Policy Methods with Approximation
Chapter 12: Eligibility Traces
Chapter 13: Policy Gradient Methods
Chapter 14: Psychology
Chapter 15: Neuroscience
Chapter 16: Applications and Case Studies
Chapter 17: Frontiers

Copyright © 2024 till now V1CeVersa

Made with Material for MkDocs