This repository contains a tensorflow HER implementation and a bit flipping environment as described in OpenAI's paper
The implementation includes :
- In
Hindsight Experience Replay.ipynb
:- A DQN and a DDQN agent (which also work on other traditional gym environments)
- A bit flipping environment
- Pre-trained models for 30-bits, 40-bits and 50-bits flipping environments
- In
ChaseEnv_DDPG.ipynb
:- A DDPG agent
- A
ChaseEnv
environment, where a chaser is initialized at a random position in a 2d plane and has to reach a goal in another random position within a certain threshold.
- 100% success rate for 30 and 40-bits environments
- 95% success rate for 50-bits environment (average on 100 tests)
- 90% success rate for size=5 ChaseEnv (average on 100 tests)
Check the "Training" cell to adjust training parameters and enable/disable HER.
- Optimize the way to concatenate transitions
- Parallelize training
- Train on bit length > 30
- Implement DDPG
Here is a link to a robot arm reach environment created in Unity, trained with ML-Agents.
This environment is trained using DDPG with and without HER, and the comparison is plotted. DDPG+HER performs better.