tildy-mdp

This is a fun project, inspired by talk of richard sutton - Tutorial: Introduction to Reinforcement Learning with Function Approximation

Play with this repo

python3 learn_mdp.py

About the project

Here the user is a reinforcement learning agent and he tries to find the optimal policy to gain maximum rewards. The environment has two states A and B. User can take 2 actions - 1,2 . Based on user's action in a state he gets positive or negative reward/feedback.

If you decide to play this game then following is the optimal policy

State	Action
A	2
B	1

This repository can be used for educational purposes. This repo can be used to explain the following concepts of Reinforcement Learning -

MDP
Exploration vs Exploitation Dilemma
Introduction to RL.

Feel free to improve this project. Pull Requests are welcome.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
learn_mdp.py		learn_mdp.py
true model of the world.jpeg		true model of the world.jpeg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tildy-mdp

Play with this repo

About the project

About

Releases

Packages

Languages

License

Arpanio/tildy-mdp

Folders and files

Latest commit

History

Repository files navigation

tildy-mdp

Play with this repo

About the project

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages