Which of the following statements are true. In RL problems
1.We assume that the agent determines the reward based on the current state and action
2.Our main aim is to get a net positive reward
3. At one time step we can perform only one action
4. Zero rewards may be possible
Answers
Answered by
1
Answer:
3. at one time we can perform only one action
Similar questions