Fundamentals of Alignment: Reinforcement Learning

10.23, 10.30, 11.06

Instructor: Yaodong Yang

Topics Covered

  • Markov Decision Process
  • Bellman Equation
  • Actor-Critic Architecture
  • Policy Gradient
  • Proximal Policy Optimization
Previous
Next