Module 4 quiz and review

Before we begin

Check CNNs, RNNs, LSTM/GRU, and embeddings before the sentiment project. Aim for at least 19 out of 25.

Multiple choice quiz

Interactive quiz

Pick one answer per question. Feedback appears immediately — take your time before clicking.

0 / 25 correct·0 answered

Question 1 of 25
Why do CNNs work better than plain MLPs on images?
Answer options for question 1
Question 2 of 25
What does parameter sharing in a conv layer mean?
Answer options for question 2
Question 3 of 25
Pooling (e.g. max pool 2×2) mainly helps by:
Answer options for question 3
Question 4 of 25
An RNN is designed for:
Answer options for question 4
Question 5 of 25
Why do plain RNNs struggle with long sequences?
Answer options for question 5
Question 6 of 25
In sentiment analysis word-by-word, the RNN hidden state at step *t* summarizes:
Answer options for question 6
Question 7 of 25
What problem does LSTM mainly address vs a vanilla RNN?
Answer options for question 7
Question 8 of 25
GRU compared to LSTM:
Answer options for question 8
Question 9 of 25
The LSTM forget gate decides:
Answer options for question 9
Question 10 of 25
What is a word embedding?
Answer options for question 10
Question 11 of 25
Why are embeddings important for GenAI?
Answer options for question 11
Question 12 of 25
Pre-trained embeddings (e.g. GloVe, Word2Vec) vs training from scratch on 500 reviews:
Answer options for question 12
Question 13 of 25
A 3×3 conv with stride 1 on a 32×32 map (no padding) produces:
Answer options for question 13
Question 14 of 25
In sentiment analysis, averaging word embeddings for a review is:
Answer options for question 14
Question 15 of 25
For product review sentiment, why might LSTM beat a bag-of-words model?
Answer options for question 15
Question 16 of 25
Stride 2 convolution (vs stride 1) typically:
Answer options for question 16
Question 17 of 25
Why share the same filter weights across all image locations in a conv layer?
Answer options for question 17
Question 18 of 25
In an RNN, hₜ (hidden state at time t) mainly summarizes:
Answer options for question 18
Question 19 of 25
The LSTM input gate decides:
Answer options for question 19
Question 20 of 25
GRU compared to LSTM often:
Answer options for question 20
Question 21 of 25
Words with similar meaning (e.g. king and queen) in a good embedding space:
Answer options for question 21
Question 22 of 25
Backprop through time (BPTT) means:
Answer options for question 22
Question 23 of 25
Padding (e.g. same padding) around a conv layer is used to:
Answer options for question 23
Question 24 of 25
Word2Vec / GloVe pre-trained embeddings help downstream tasks because:
Answer options for question 24
Question 25 of 25
For very long documents, plain RNNs struggle mainly because:
Answer options for question 25

After the quiz

19/25 or higher? Start the sentiment project.

Checklist:

I can explain why CNNs suit images.
I know why vanilla RNNs forget long context.
I can describe LSTM gates in plain language.
I know what embeddings are and when to use pre-trained ones.

What's next

Project: sentiment analysis + dashboard