Welcome to Module 3 — deep learning for vision

Before we begin

Modules 1–2 were classical CV: hand-designed filters and geometry. Module 3 learns representations from data with convolutional neural networks.

You will train models in PyTorch, practice augmentation and transfer learning, and fine-tune an image classifier.

Figure

Module 3 at a glance

CNNs, augmentation, transfer learning, quiz, classifier project.

Install before the project

Python 3.10+
pip install torch torchvision matplotlib numpy
Optional GPU: CUDA-enabled PyTorch from pytorch.org

Connection to Modules 1–2

Classical idea	Deep learning version
Convolution filters	Learned conv kernels
Hand-crafted features	Learned feature maps
RANSAC geometry	Often still used with learned detectors

What's next

Lesson 1 — Convolutional networks for images