V1CeVersa's Notebook
Alignment with RL II
正在初始化搜索引擎
V1CeVersaa/notebook
Home
Computer Science
System
AI
RL
Math
Varia
V1CeVersa's Notebook
V1CeVersaa/notebook
Home
Computer Science
System
AI
AI
Framework & Toolkits
人工智能逻辑
机器学习基本知识
Machine Learning Theory
Deep Learning
Language Models
Language Models
Overview & Tokenization
Resource Accounting
Architectures & Hyperparameters
Mixture of Experts
GPUs
Kernels & Triton
Parallelism I
Parallelism II
Scaling Laws I
Inference
Scaling Laws II
Evaluation
Data I
Data II
Alignment with SFT/RLHF
Alignment with RL I
Alignment with RL II
Online Learning
Casual Inference
RL
Math
Varia
Alignment with RL II
约 0 个字
预计阅读时间不到 1 分钟
回到页面顶部