Welcome to Module 3 — deep learning for vision
Before we begin
Modules 1–2 were classical CV: hand-designed filters and geometry. Module 3 learns representations from data with convolutional neural networks.
You will train models in PyTorch, practice augmentation and transfer learning, and fine-tune an image classifier.
Figure
Module 3 at a glance
Install before the project
- Python 3.10+
pip install torch torchvision matplotlib numpy- Optional GPU: CUDA-enabled PyTorch from pytorch.org
Connection to Modules 1–2
| Classical idea | Deep learning version |
|---|---|
| Convolution filters | Learned conv kernels |
| Hand-crafted features | Learned feature maps |
| RANSAC geometry | Often still used with learned detectors |