Question 1 of 20
In the RL loop, what does the agent send to the environment?
Question 2 of 20
The Markov property means:
Question 3 of 20
Discount factor γ = 0.9 primarily:
Question 4 of 20
Q^π(s,a) measures:
Question 5 of 20
Supervised learning differs from RL because RL:
Question 6 of 20
In the RL loop, what does the agent send to the environment? (variant 2)
Question 7 of 20
The Markov property means: (variant 2)
Question 8 of 20
Discount factor γ = 0.9 primarily: (variant 2)
Question 9 of 20
Q^π(s,a) measures: (variant 2)
Question 10 of 20
Supervised learning differs from RL because RL: (variant 2)
Question 11 of 20
In the RL loop, what does the agent send to the environment? (variant 3)
Question 12 of 20
The Markov property means: (variant 3)
Question 13 of 20
Discount factor γ = 0.9 primarily: (variant 3)
Question 14 of 20
Q^π(s,a) measures: (variant 3)
Question 15 of 20
Supervised learning differs from RL because RL: (variant 3)
Question 16 of 20
In the RL loop, what does the agent send to the environment? (variant 4)
Question 17 of 20
The Markov property means: (variant 4)
Question 18 of 20
Discount factor γ = 0.9 primarily: (variant 4)
Question 19 of 20
Q^π(s,a) measures: (variant 4)
Question 20 of 20
Supervised learning differs from RL because RL: (variant 4)