← Back to curriculum

Module 1 — RL foundations & MDPs

Module 1 quiz & review

20 interactive MCQs with instant feedback and lesson links for topics you miss.

~45 min read + exercises

Module 1 quiz and review

Before we begin

Use this quiz to retrieve what you learned — guessing first, then reading feedback, beats passive re-reading.

Aim for at least 80% before starting the project. Use Try again to reset.


Multiple choice quiz

Interactive quiz

Pick one answer per question. Feedback appears immediately — take your time before clicking.

0 / 20 correct·0 answered
  1. Question 1 of 20

    In the RL loop, what does the agent send to the environment?

    Answer options for question 1
  2. Question 2 of 20

    The Markov property means:

    Answer options for question 2
  3. Question 3 of 20

    Discount factor γ = 0.9 primarily:

    Answer options for question 3
  4. Question 4 of 20

    Q^π(s,a) measures:

    Answer options for question 4
  5. Question 5 of 20

    Supervised learning differs from RL because RL:

    Answer options for question 5
  6. Question 6 of 20

    In the RL loop, what does the agent send to the environment? (variant 2)

    Answer options for question 6
  7. Question 7 of 20

    The Markov property means: (variant 2)

    Answer options for question 7
  8. Question 8 of 20

    Discount factor γ = 0.9 primarily: (variant 2)

    Answer options for question 8
  9. Question 9 of 20

    Q^π(s,a) measures: (variant 2)

    Answer options for question 9
  10. Question 10 of 20

    Supervised learning differs from RL because RL: (variant 2)

    Answer options for question 10
  11. Question 11 of 20

    In the RL loop, what does the agent send to the environment? (variant 3)

    Answer options for question 11
  12. Question 12 of 20

    The Markov property means: (variant 3)

    Answer options for question 12
  13. Question 13 of 20

    Discount factor γ = 0.9 primarily: (variant 3)

    Answer options for question 13
  14. Question 14 of 20

    Q^π(s,a) measures: (variant 3)

    Answer options for question 14
  15. Question 15 of 20

    Supervised learning differs from RL because RL: (variant 3)

    Answer options for question 15
  16. Question 16 of 20

    In the RL loop, what does the agent send to the environment? (variant 4)

    Answer options for question 16
  17. Question 17 of 20

    The Markov property means: (variant 4)

    Answer options for question 17
  18. Question 18 of 20

    Discount factor γ = 0.9 primarily: (variant 4)

    Answer options for question 18
  19. Question 19 of 20

    Q^π(s,a) measures: (variant 4)

    Answer options for question 19
  20. Question 20 of 20

    Supervised learning differs from RL because RL: (variant 4)

    Answer options for question 20

After the quiz

Passed? Continue to the project.

Need review? Use topic links in your results, re-read those lessons, then retake.