← Back to curriculum

Module 1 — Imaging & digital images

Welcome — start here

Key CV vocabulary (pixel, radiance, convolution, ISP), how to read lessons, what Module 1 covers, and what to install before the project.

~30 min read + exercises

Welcome — start here

Before we begin

If you are new to computer vision, you are in the right place. This course is standalone — you do not need the AI track first, though some ideas overlap (images as numbers, CNNs later).

This page answers questions beginners actually have:

  • What is computer vision in plain language?
  • What vocabulary will I need (pixel, convolution, ISP, …)?
  • How should I read each lesson?
  • What does Module 1 cover, and what do I install before the project?

Key concepts (plain English)

Computer vision — Algorithms that take images or video as input and output useful information: labels, boxes, masks, depth, or motion.

Pixel — Smallest picture element; stores one or more numbers (brightness or R, G, B).

Image / tensor — A grid (or grid × channels) of numbers the algorithm reads. Files on disk are decoded into arrays first.

Convolution — Slide a small filter over the image, multiply-and-add at each position. Used for blur, edges, and (in deep learning) feature extraction.

ISP (image signal processor) — Pipeline inside a camera/phone that turns sensor data into the JPEG you see: demosaic, white balance, gamma, noise reduction.

Preprocessing — Resize, normalize, color convert — must match between training and deployment.

Figure

Computer vision at a glance

LightsceneSensorpixelsAlgorithmfeatures / CNNOutputlabels / masks
Light becomes pixels; algorithms extract structure; applications get labels, masks, or motion.

Figure

Module 1 at a glance

Module 1 — imaging pathWork top to bottom. Each lesson builds on the previous one.1Welcomeyou are here2Light & ISPsensors3Convolutionedges4Colorpreprocess5Quizself-check6Projectpipeline lab
Welcome, three core lessons, quiz, then a hands-on imaging pipeline lab.

What is this course?

Computer Vision Foundations walks from physics and pixels through geometry, deep learning, segmentation, video, and production deployment — with quizzes and projects in every module.

Module 1 in one sentence

You will understand what a digital image really is (light → sensor → numbers) and how convolution and color prepare data for later algorithms.

LessonTopic
1Light, sensors, ISP, noise
2Convolution, edges, Canny
3Color spaces & preprocessing
Quiz20 MCQs with review links
ProjectImaging pipeline lab in Python

Before you start

Required: comfort with basic algebra and reading small numeric tables.

Install before the project:

  • Python 3.10+
  • pip install numpy matplotlib opencv-python

Lessons 1–3 are reading + pencil exercises. The project is where you write code.


How to read each lesson

  1. Read Before we begin and Learning objectives.
  2. Do checkpoint questions before peeking at answers.
  3. Skim code only when the lesson says to — Module 1 is mostly intuition.
  4. After the quiz, use Topics to review links honestly.

Progress saves in this browser when you open a lesson.


What's next

Lesson 1 — Light, sensors, and the imaging pipeline