← Back to curriculum

Module 2 — Core machine learning

Module 2 quiz & review

25 interactive multiple-choice questions covering splits, metrics, overfitting, and supervised learning with lesson review links.

~55 min read + exercises

Module 2 quiz and review

Before we begin

Same approach as Module 1: guess first, click, read feedback, follow lesson links for misses.

  1. Read each question fully.
  2. Commit to an answer before clicking.
  3. Use wrong-answer links to re-read weak topics.
  4. Aim for at least 19 out of 25 before the project.

Click Try again to reset the whole quiz.


Multiple choice quiz

Interactive quiz

Pick one answer per question. Feedback appears immediately — take your time before clicking.

0 / 25 correct·0 answered
  1. Question 1 of 25

    What makes learning supervised?

    Answer options for question 1
  2. Question 2 of 25

    Which task is unsupervised?

    Answer options for question 2
  3. Question 3 of 25

    What is the main difference between regression and classification?

    Answer options for question 3
  4. Question 4 of 25

    Which problem is classification?

    Answer options for question 4
  5. Question 5 of 25

    Why does overfitting happen?

    Answer options for question 5
  6. Question 6 of 25

    Training accuracy is 99% but test accuracy is 52%. What fits best?

    Answer options for question 6
  7. Question 7 of 25

    Why do we hold out a test set and not tune the model on it?

    Answer options for question 7
  8. Question 8 of 25

    Why do we need a validation set (separate from test)?

    Answer options for question 8
  9. Question 9 of 25

    You preprocess the whole dataset (normalize using global mean/std) before splitting. What went wrong?

    Answer options for question 9
  10. Question 10 of 25

    When is accuracy a bad primary metric?

    Answer options for question 10
  11. Question 11 of 25

    In spam filtering, precision answers which question?

    Answer options for question 11
  12. Question 12 of 25

    In spam filtering, recall answers which question?

    Answer options for question 12
  13. Question 13 of 25

    A confusion matrix for binary spam detection has high false positives. What is the user-visible problem?

    Answer options for question 13
  14. Question 14 of 25

    Logistic regression is used for spam detection. Despite the name, it is mainly a ___ model.

    Answer options for question 14
  15. Question 15 of 25

    Which action reduces overfitting?

    Answer options for question 15
  16. Question 16 of 25

    K-means clustering is an example of:

    Answer options for question 16
  17. Question 17 of 25

    Predicting tomorrow’s temperature in °C is best framed as:

    Answer options for question 17
  18. Question 18 of 25

    You tune hyperparameters using the test set and report final accuracy on the same set. What went wrong?

    Answer options for question 18
  19. Question 19 of 25

    Underfitting usually means:

    Answer options for question 19
  20. Question 20 of 25

    When false negatives are costly (e.g. missing cancer on a scan), which metric do you prioritize raising?

    Answer options for question 20
  21. Question 21 of 25

    Precision answers which question for spam filtering?

    Answer options for question 21
  22. Question 22 of 25

    Why keep a validation set separate from test?

    Answer options for question 22
  23. Question 23 of 25

    Which is supervised learning?

    Answer options for question 23
  24. Question 24 of 25

    Dataset: 98% ham, 2% spam. A model always predicts ham. Accuracy is ~98%. Is this good?

    Answer options for question 24
  25. Question 25 of 25

    Early stopping during training helps because:

    Answer options for question 25

After the quiz

Scored 19/25 or higher? Move to the spam classifier project.

Scored lower? Re-read linked lessons, then Try again.

Checklist before coding:

  • I can explain supervised vs unsupervised with examples.
  • I know regression vs classification outputs.
  • I can describe overfitting from train vs test metrics.
  • I know why validation exists separately from test.
  • I can define precision and recall for spam filtering.

What's next

Project: spam classifier with API & MongoDB