Attention-based Partially Decoupled Actor-Critic (APDAC)

This repository contains the code for the following paper presented at the Deep RL Workshop, NeurIPS 2021:
Attention-based Partial Decoupling of Policy and Value for Generalization in Reinforcement Learning.

Citation

If you use this code, please cite our paper:

Nafi, N.M., Glasscock, C. and Hsu, W. (2021). Attention-based Partial Decoupling of Policy and Value for Generalization in Reinforcement Learning. In Deep Reinforcement Learning Workshop, NeurIPS 2021.

Our code is largely based on this implementation and the corresponding paper is available here. Their implementation used an open sourced PyTorch implementation of PPO.

Dependencies

Run the following to create the environment and install the required dependencies:

conda create -n apdac python=3.7
conda activate apdac

cd apdac
pip install -r requirements.txt

pip install procgen

pip install protobuf==3.20.0

git clone https://github.com/openai/baselines.git
cd baselines 
python setup.py install

Instructions

To Train APDAC on CoinRun

python train.py --env_name coinrun --algo apdac

To Train IDAAC on CoinRun

python train.py --env_name coinrun --algo idaac

To Train PPO on CoinRun

python train.py --env_name coinrun --algo ppo --ppo_epoch 3

APDAC uses the same set of hyperparameters for all environments. Please refer to the paper for the details and the experimental results. APDAC significantly outperforms the PPO baseline and achieves comparable performance with respect to the recent state-of-the-art method IDAAC on the challenging RL generalization benchmark Procgen. Thus, APDAC demonstrates similar generalization benefits of a fully decoupled approach while reducing the overall parameters and computational cost.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ppo_idaac_apdac		ppo_idaac_apdac
README.md		README.md
hyperparams.py		hyperparams.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attention-based Partially Decoupled Actor-Critic (APDAC)

Citation

Dependencies

Instructions

To Train APDAC on CoinRun

To Train IDAAC on CoinRun

To Train PPO on CoinRun

About

Releases

Packages

Languages

NasikNafi/apdac

Folders and files

Latest commit

History

Repository files navigation

Attention-based Partially Decoupled Actor-Critic (APDAC)

Citation

Dependencies

Instructions

To Train APDAC on CoinRun

To Train IDAAC on CoinRun

To Train PPO on CoinRun

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages