Skip to content

Latest commit

 

History

History
38 lines (27 loc) · 1.6 KB

README.md

File metadata and controls

38 lines (27 loc) · 1.6 KB

Reinforcement Learning Baseline

We tend to implement stable versions of popular deep reinforcement learning algorithms and test them in various problems.

Algorithms:

Cross-Entropy Method (CEM)

Deep Q-Network (DQN)

Double Deep Q-Network (DDQN)

Deep Deterministic Policy Gradient (DDPG)

Normalized Adavtage Functions (NAF)

Asynchronous Advantage Actor-Critic (A3C)

Continuous Value Iteration (CVI)

Proximal Policy Optimization (PPO)

Soft Actor-Critic (SAC)