Pages

Home
About CMPUT 605
About CMPUT 653 (OLD)
Lectures
The work you do

Lecture Notes

Planning in MDPs
Online RL

Winter 2022 Lecture Notes

Planning in MDPs
Batch RL
Online RL

Winter 2021 Lecture Notes

Planning in MDPs
Batch RL
- 17. Introduction
- 18. Sample complexity in finite MDPs
Online RL
- Blank

Website of the course CMPUT 653: Theoretical Foundations of Reinforcement Learning.

Planning in MDPs

Table of contents

1. Introductions
2. The Fundamental Theorem
3. Value Iteration and Our First Lower Bound
4. Policy Iteration
5. Online Planning - Part I.
6. online planning - Part II.
7. Function Approximation
8. Approximate Policy Iteration
9. Limits of query-efficient planning
10. Planning under $q^*$ realizability
11. Planning under $v^*$ realizability (TensorPlan I.)
12. TensorPlan and eluder sequences
13. From API to Politex
14. Politex
15. From policy search to policy gradients
16. Policy gradients

Copyright © 2020 RL Theory.

Page last modified: Dec 24 2020.