Pacman Qlearning Agent

This script is based on the python script written by Parsons.

What is it about?

The main idea of this script is the implement of Q-learning method in Pacman game in UC Berkeley CS188 Intro to AI.

The script contains a Q-learning agent player class of Pacman game. Note: this is not a solution for the coursework in CS188.

How does it work?

Download the reinforcement pack of Pacman game from here
Unzip the package and place mlLearningAgents.py inside the directory.
Run the training command in the directory. i.e. for 2000 runs of training and 10 runs of playing in a small grid, run:
```
   python pacman.py -p QLearnAgent -x 2000 -n 2010 -l smallGrid
```

Implementation of Q-learning

The structure of agent class is given: Function getAction() executes while the agent needs to take an action. Function final() is called at the end of every game.

The steps of implying Q-learning in a Pacman game:

initialize Q(s,a)
take a random action
update Q(s,a)
choose the action maximises Q or a random action according to Ɛ-greedy function
repeat step 3 and 4 until the game ends
update Q(s,a) where s is the last state before the end, a is the last action taken

For initiaization we need to add some attributes to the class:

a dictionary for storing Q(s,a)
a list records last state
a list records last action
a variable stores the score before last action

For storing Q(s,a), a handy structure called Counter can be found in util.py. We can make use of the function argMax() to return the action maximises Q.

Step 2-4 in Q-learning can be included in getAction() function. step 6 could be included in final() function.

getAction():

observe the reward of state
update Q(s,a)
Ɛ-greedy choose action
update attributes
return action

final():

observe reward
update Q(s,a)
reset attributes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Pacman Qlearning Agent

What is it about?

How does it work?

Implementation of Q-learning

Files

README.md

Latest commit

History

README.md

File metadata and controls

Pacman Qlearning Agent

What is it about?

How does it work?

Implementation of Q-learning