GitHub - lukau2357/bipedal-walker-td3: Project for Artificial Intelligence course at University of Ljubljana, Faculty of Computer and Information science.

Project description

Implementation of the TD3 - twin delayed DDPG algorithm for reinforcement learning (original publication link), particularlly usefull for continuous action space-continuous state space problems.

The algorithm was tested on the BipedalWalker-v3 environment. In order to evaluate the variability of this algorithm, we trained 15 different agents on a high-performance GPU with CUDA for 550 episodes. We recorded the obtained reward by each agent, and obtained the following results:

The learning process can be observed on the following video:

Technical details about the algorithm can be found in the acompanying report.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
presentation.pdf		presentation.pdf
report.pdf		report.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project description

About

Releases

Contributors 2

Languages

License

lukau2357/bipedal-walker-td3

Folders and files

Latest commit

History

Repository files navigation

Project description

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Contributors 2

Languages