Difference between value iteration and policy iteration
Answers
Answered by
1
Answer:
What is the difference between value iteration and policy iteration? ... As much as I understand, in value iteration, you use the Bellman equation to solve for the optimal policy, whereas, in policy iteration, you randomly select a policy π, and find the reward of that policy....
Similar questions
Business Studies,
5 months ago
English,
5 months ago