V1CeVersa's Notebook
Alignment with RL I
正在初始化搜索引擎
    V1CeVersaa/notebook
    • Home
    • Computer Science
    • System
    • AI
    • RL
    • Math
    • Varia
    V1CeVersaa/notebook
    • Home
    • Computer Science
    • System
    • AI
      • Framework & Toolkits
      • 人工智能逻辑
      • 机器学习基本知识
      • Machine Learning Theory
      • Deep Learning
      • Language Models
        • Overview & Tokenization
        • Resource Accounting
        • Architectures & Hyperparameters
        • Mixture of Experts
        • GPUs
        • Kernels & Triton
        • Parallelism I
        • Parallelism II
        • Scaling Laws I
        • Inference
        • Scaling Laws II
        • Evaluation
        • Data I
        • Data II
        • Alignment with SFT/RLHF
        • Alignment with RL I
        • Alignment with RL II
      • Online Learning
      • Casual Inference
    • RL
    • Math
    • Varia

    Alignment with RL I

    约 0 个字 预计阅读时间不到 1 分钟

    Copyright © 2024 till now V1CeVersa
    Made with Material for MkDocs