Fundamentals of Reinforcement Learning Course Syllabus

Full curriculum breakdown — modules, lessons, estimated time, and outcomes.

Overview: This course offers a comprehensive introduction to reinforcement learning, focusing on core concepts and practical implementations. You'll explore sequential decision-making, Markov Decision Processes, value functions, and dynamic programming. The course blends theory with hands-on coding exercises, enabling you to build foundational skills in AI-driven decision systems. With approximately 15 hours of content, the course is designed for self-paced learning, ideal for those seeking a structured yet flexible path into reinforcement learning.

Module 1: Welcome to the Course!

Estimated time: 1 hour

Introduction to course structure and objectives
Meet the instructors from the University of Alberta
Course roadmap and learning outcomes
Navigating the Coursera platform

Module 2: An Introduction to Sequential Decision-Making

Estimated time: 3 hours

Understanding the exploration-exploitation trade-off
Implementing incremental algorithms for action-value estimation
Comparing exploration strategies
Foundations of reward-based learning

Module 3: Markov Decision Processes

Estimated time: 3 hours

Translating real-world problems into MDPs
Understanding goal-directed behavior via reward maximization
Differentiating episodic and continuing tasks
Formalizing decision-making problems using MDPs

Module 4: Value Functions & Bellman Equations

Estimated time: 3 hours

Defining policies and value functions
Understanding state-value and action-value functions
Deriving and applying Bellman equations
Role of Bellman equations in optimal decision-making

Module 5: Dynamic Programming

Estimated time: 3 hours

Computing value functions using dynamic programming
Implementing policy evaluation and improvement
Understanding Generalized Policy Iteration
Solving MDPs efficiently with iterative methods

Module 6: Final Project

Estimated time: 2 hours

Design and implement a reinforcement learning agent
Apply MDPs and dynamic programming to a simulated environment
Submit code and analysis for peer review

Prerequisites

Proficiency in Python programming
Basic understanding of probability and linear algebra
Familiarity with fundamental machine learning concepts

What You'll Be Able to Do After

Formulate decision-making problems using MDPs
Implement and compare exploration strategies
Apply value functions and Bellman equations to real-world scenarios
Use dynamic programming to solve reinforcement learning tasks
Build foundational knowledge for roles in AI, robotics, and data science

View Full Course Review

Fundamentals of Reinforcement Learning Course Syllabus

Module 1: Welcome to the Course!

Module 2: An Introduction to Sequential Decision-Making

Module 3: Markov Decision Processes

Module 4: Value Functions & Bellman Equations

Module 5: Dynamic Programming

Module 6: Final Project

Prerequisites

What You'll Be Able to Do After

Course AI Assistant Beta