Hindsight-Experience-Replay

This repository provides the Pytorch implementation of Hindsight Experience Replay on Deep Q Network and Deep Deterministic Policy Gradient algorithms.

Link to the paper: https://arxiv.org/pdf/1707.01495.pdf

Authors: Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin, Pieter Abbeel, Wojciech Zaremba

Training

You can train the model simply by running the main.py files.

DQN With HER -> HERmain.py

DDPG With HER -> DDPG_HER_main.py

DQN Without HER -> main.py
You can set the hyper-parameters such as learning_rate, discount factor (gamma), epsilon, and others while initializing the agent variable in the above-mentioned files

Running the pre-trained model

Just run the files mentioned in the Training section with making the load_checkpoint variable to True which will load the saved parameters of the model and output the results. Just update the paths as per the saved results path.

Results


With average	Without average (contains spikes)

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
ddpg_with_her		ddpg_with_her
dqn_with_her		dqn_with_her
dqn_without_her		dqn_without_her
results and plots		results and plots
.gitignore		.gitignore
BitFlipEnv.py		BitFlipEnv.py
DeepQNetwork.py		DeepQNetwork.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hindsight-Experience-Replay

Training

Running the pre-trained model

Results

References

About

Releases

Packages

Languages

hemilpanchiwala/Hindsight-Experience-Replay

Folders and files

Latest commit

History

Repository files navigation

Hindsight-Experience-Replay

Training

Running the pre-trained model

Results

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages