Policy and value iteration

Loading...
From the course by National Research University Higher School of Economics
Practical Reinforcement Learning
31 ratings
National Research University Higher School of Economics
31 ratings
Course 4 of 7 in the Specialization Advanced Machine Learning
From the lesson
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

Meet the Instructors

  • Pavel Shvechikov
    Pavel Shvechikov
    Researcher at HSE and Sberbank AI Lab
    HSE Faculty of Computer Science
  • Alexander Panin
    Alexander Panin
    Lecturer
    HSE Faculty of Computer Science

Explore our Catalog

Join for free and get personalized recommendations, updates and offers.