Computer Science, asked by murtazajamali059, 4 months ago

Consider the following environment of PacMan
For

For the environment design a Reinforcement Learning Agent (Pacman), the objective of the agent is to figure out the best actions the agent can take at any given state.
The rules of the game are as follows:
 Every move has a reward of -1
 Consuming a food pellet will have a reward of +10
 If pacman collides with a ghost, then the reward will be -500
 If the pacman has eaten all the food pellets without colliding with the ghosts, then the reward will be +500
 Assume a discount factor of 0.8
 The action noise is 0.3 (the consequences are the same as in the grid world example)
 The environment is static i.e. no ghosts are moving
 The actions for pacman are Up, Down, North and Right
 You can cross the walls
Use Q-Learning to figure out the best action at every state. Show your working for every iteration of Q-Learning.

Attachments:

Answers

Answered by rashi029

0

Answer:

Nice Game nothing

Previous Question

Next Question

Similar questions

Math, 2 months ago

9. If a man runs 3 km per day, how many km does he run in 2 weeks?

Science, 2 months ago

1. Where do we get energy from?

English, 2 months ago

shortcut key to copy the text...

Math, 4 months ago

find out the perimeter of the given picture...

Science, 4 months ago

Rohan took two identical sheets of paper. He crumbled the first sheet into a tight paper ball. He then dropped the crumbled paper ball and the paper sheet from equal heights above the ground at the same time. He found that the crumbled ball fell on the ground first) (a) Which force did Rohan apply to crumble the paper sheet? (b) Which force pulled tha paper ball towards the ground? (c) Why the...

English, 11 months ago

When did bhola grandpalet out a loud wail.

English, 11 months ago

Give the central idea of fable poem...

Math, 11 months ago

Using the prime factorization method find which of the following number are perfect square 576...