Question 1 of 20
Which statement best relates to Dynamic programming — policy & value iteration?
Question 2 of 20
Which statement best relates to Monte Carlo methods?
Question 3 of 20
Which statement best relates to Temporal-difference learning?
Question 4 of 20
Which statement best relates to Q-learning & SARSA?
Question 5 of 20
Which statement best relates to On-policy vs off-policy?
Question 6 of 20
Which statement best relates to Dynamic programming — policy & value iteration? (variant 2)
Question 7 of 20
Which statement best relates to Monte Carlo methods? (variant 2)
Question 8 of 20
Which statement best relates to Temporal-difference learning? (variant 2)
Question 9 of 20
Which statement best relates to Q-learning & SARSA? (variant 2)
Question 10 of 20
Which statement best relates to On-policy vs off-policy? (variant 2)
Question 11 of 20
Which statement best relates to Dynamic programming — policy & value iteration? (variant 3)
Question 12 of 20
Which statement best relates to Monte Carlo methods? (variant 3)
Question 13 of 20
Which statement best relates to Temporal-difference learning? (variant 3)
Question 14 of 20
Which statement best relates to Q-learning & SARSA? (variant 3)
Question 15 of 20
Which statement best relates to On-policy vs off-policy? (variant 3)
Question 16 of 20
Which statement best relates to Dynamic programming — policy & value iteration? (variant 4)
Question 17 of 20
Which statement best relates to Monte Carlo methods? (variant 4)
Question 18 of 20
Which statement best relates to Temporal-difference learning? (variant 4)
Question 19 of 20
Which statement best relates to Q-learning & SARSA? (variant 4)
Question 20 of 20
Which statement best relates to On-policy vs off-policy? (variant 4)