Fundamentals of Reinforcement Learning Course Syllabus

Full curriculum breakdown — modules, lessons, estimated time, and outcomes.

Overview: This course offers a comprehensive introduction to reinforcement learning, focusing on core concepts and practical implementations. You'll explore sequential decision-making, Markov Decision Processes, value functions, and dynamic programming. The course blends theory with hands-on coding exercises, enabling you to build foundational skills in AI-driven decision systems. With approximately 15 hours of content, the course is designed for self-paced learning, ideal for those seeking a structured yet flexible path into reinforcement learning.

Module 1: Welcome to the Course!

Estimated time: 1 hour

  • Introduction to course structure and objectives
  • Meet the instructors from the University of Alberta
  • Course roadmap and learning outcomes
  • Navigating the Coursera platform

Module 2: An Introduction to Sequential Decision-Making

Estimated time: 3 hours

  • Understanding the exploration-exploitation trade-off
  • Implementing incremental algorithms for action-value estimation
  • Comparing exploration strategies
  • Foundations of reward-based learning

Module 3: Markov Decision Processes

Estimated time: 3 hours

  • Translating real-world problems into MDPs
  • Understanding goal-directed behavior via reward maximization
  • Differentiating episodic and continuing tasks
  • Formalizing decision-making problems using MDPs

Module 4: Value Functions & Bellman Equations

Estimated time: 3 hours

  • Defining policies and value functions
  • Understanding state-value and action-value functions
  • Deriving and applying Bellman equations
  • Role of Bellman equations in optimal decision-making

Module 5: Dynamic Programming

Estimated time: 3 hours

  • Computing value functions using dynamic programming
  • Implementing policy evaluation and improvement
  • Understanding Generalized Policy Iteration
  • Solving MDPs efficiently with iterative methods

Module 6: Final Project

Estimated time: 2 hours

  • Design and implement a reinforcement learning agent
  • Apply MDPs and dynamic programming to a simulated environment
  • Submit code and analysis for peer review

Prerequisites

  • Proficiency in Python programming
  • Basic understanding of probability and linear algebra
  • Familiarity with fundamental machine learning concepts

What You'll Be Able to Do After

  • Formulate decision-making problems using MDPs
  • Implement and compare exploration strategies
  • Apply value functions and Bellman equations to real-world scenarios
  • Use dynamic programming to solve reinforcement learning tasks
  • Build foundational knowledge for roles in AI, robotics, and data science
View Full Course Review

Course AI Assistant Beta

Hi! I can help you find the perfect online course. Ask me something like “best Python course for beginners” or “compare data science courses”.