Course on Reinforcement Learning

Lecture 0: Introduction to the Course

Lectures

** Slides will be uploaded before each class (otherwise see slides from last year).

** The lecture notes are a bit outdated now, if you want to look at them refer to the material from last year.

News

• Changes in the schedule: Class on 10/11 is canceled and it is moved to 17/11 in the afternoon from 2:45pm to 6:45pm in Salle Condorcet, while the session of 17/11 is confirmed in the morning from 11am to 1pm as usual.
• First round of project proposals is available here. The proposals will be updated/integrated in the coming days.
• Changes in the schedule: On 1/12 we will have lecture, the class of 8/12 is canceled, while the last class on 15/12 will be the last TP of the course.

Lecture 1: A Bit of History

Lecture 2: MDP and Dynamic Programming

Lecture 3: Reinforcement Learning Algorithms

Lecture 4: The Multi-Armed Bandit Framework

Lecture 5: Approximate Dynamic Programming