Computer Science, asked by hicrk2412, 10 months ago

Which of the following statements are true. In RL problems
1.We assume that the agent determines the reward based on the current state and action
2.Our main aim is to get a net positive reward
3. At one time step we can perform only one action
4. Zero rewards may be possible

Answers

Answered by jenilparikh134
1

Answer:

3. at one time we can perform only one action

Similar questions