This course covers the core algorithms and ideas at the intersection of deep learning and robotics, progressing from supervised imitation learning through model-based planning, policy gradient methods, value-based RL, goal-conditioned policies, reward learning, and practical deployment concerns. Slides and book chapters are drawn from three years of the graduate course at Université de Montréal.
Programming assignments for this course are available on GitHub (milarobotlearningcourse). Note that the assignments are a work in progress and will continue to be updated.
Lectures
Foundations
Behavior Cloning & Imitation
Planning & Model-Based RL
Policy Gradient Methods
Value-Based Methods
Sequence Models
Goal-Conditioned & Language-Guided RL
Reward Learning
Hierarchical RL
Multi-Agent RL & HRI
Offline RL
Transfer & Generalization
Continual & Autonomous Learning
Changelog
- 2026-06-14 — Initial website launch with lecture slides, videos, and readings.