Welcome to Module 8

Before we begin

Welcome to Module 8 of Deep Reinforcement Learning. This module is standalone within the track — read lessons in order unless you already know the prerequisites listed below.

Figure

Module 8 at a glance

Welcome, core lessons, quiz, then a hands-on project.

What this module covers

Lesson	Focus
Continuous action spaces	Core concepts + checkpoints
DDPG & deterministic policies	Core concepts + checkpoints
Soft actor–critic (SAC)	Core concepts + checkpoints
Sim-to-real & domain randomization	Core concepts + checkpoints
Robotics RL case studies	Core concepts + checkpoints
Quiz	Multiple-choice review with lesson links
Project	Portfolio-ready code you can extend

Prerequisites

Comfort with basic Python and NumPy.
For Module 1: no prior RL required. Later modules assume earlier modules in this track (or equivalent background).
Optional: the AI course Modules 1–3 help with gradients and neural networks before Module 4.

Figure

The RL loop

Agent observes state, chooses action, receives reward — repeat.

What to install before the project

Python 3.10+
pip install gymnasium numpy matplotlib
From Module 4 onward: pip install torch
From Module 6 onward: pip install stable-baselines3 (optional but recommended for PPO/SAC labs)

Ready?

Open the first technical lesson: Continuous action spaces