A codebase for continuous action spaces Reinforcement Learning algorithms
Currently contains these four algorithms:
- Deep Deterministic Policy Gradient (DDPG)
- Twin Delayed Deep Deterministic Policy Gradient (TD3)
- Soft Actor-Critic (SAC)
- Soft Q-Learning (SQL)
If anyone is interested in contributing, contact me please