← Back to curriculum

Module 4 — Deep learning architectures

Module 4 quiz & review

25 interactive questions on CNNs, RNNs, LSTM/GRU, and embeddings with lesson review links.

~55 min read + exercises

Module 4 quiz and review

Before we begin

Check CNNs, RNNs, LSTM/GRU, and embeddings before the sentiment project. Aim for at least 19 out of 25.


Multiple choice quiz

Interactive quiz

Pick one answer per question. Feedback appears immediately — take your time before clicking.

0 / 25 correct·0 answered
  1. Question 1 of 25

    Why do CNNs work better than plain MLPs on images?

    Answer options for question 1
  2. Question 2 of 25

    What does parameter sharing in a conv layer mean?

    Answer options for question 2
  3. Question 3 of 25

    Pooling (e.g. max pool 2×2) mainly helps by:

    Answer options for question 3
  4. Question 4 of 25

    An RNN is designed for:

    Answer options for question 4
  5. Question 5 of 25

    Why do plain RNNs struggle with long sequences?

    Answer options for question 5
  6. Question 6 of 25

    In sentiment analysis word-by-word, the RNN hidden state at step *t* summarizes:

    Answer options for question 6
  7. Question 7 of 25

    What problem does LSTM mainly address vs a vanilla RNN?

    Answer options for question 7
  8. Question 8 of 25

    GRU compared to LSTM:

    Answer options for question 8
  9. Question 9 of 25

    The LSTM forget gate decides:

    Answer options for question 9
  10. Question 10 of 25

    What is a word embedding?

    Answer options for question 10
  11. Question 11 of 25

    Why are embeddings important for GenAI?

    Answer options for question 11
  12. Question 12 of 25

    Pre-trained embeddings (e.g. GloVe, Word2Vec) vs training from scratch on 500 reviews:

    Answer options for question 12
  13. Question 13 of 25

    A 3×3 conv with stride 1 on a 32×32 map (no padding) produces:

    Answer options for question 13
  14. Question 14 of 25

    In sentiment analysis, averaging word embeddings for a review is:

    Answer options for question 14
  15. Question 15 of 25

    For product review sentiment, why might LSTM beat a bag-of-words model?

    Answer options for question 15
  16. Question 16 of 25

    Stride 2 convolution (vs stride 1) typically:

    Answer options for question 16
  17. Question 17 of 25

    Why share the same filter weights across all image locations in a conv layer?

    Answer options for question 17
  18. Question 18 of 25

    In an RNN, hₜ (hidden state at time t) mainly summarizes:

    Answer options for question 18
  19. Question 19 of 25

    The LSTM input gate decides:

    Answer options for question 19
  20. Question 20 of 25

    GRU compared to LSTM often:

    Answer options for question 20
  21. Question 21 of 25

    Words with similar meaning (e.g. king and queen) in a good embedding space:

    Answer options for question 21
  22. Question 22 of 25

    Backprop through time (BPTT) means:

    Answer options for question 22
  23. Question 23 of 25

    Padding (e.g. same padding) around a conv layer is used to:

    Answer options for question 23
  24. Question 24 of 25

    Word2Vec / GloVe pre-trained embeddings help downstream tasks because:

    Answer options for question 24
  25. Question 25 of 25

    For very long documents, plain RNNs struggle mainly because:

    Answer options for question 25

After the quiz

19/25 or higher? Start the sentiment project.

Checklist:

  • I can explain why CNNs suit images.
  • I know why vanilla RNNs forget long context.
  • I can describe LSTM gates in plain language.
  • I know what embeddings are and when to use pre-trained ones.

What's next

Project: sentiment analysis + dashboard