Wolpertinger Training with DDPG for Cache Environment(Pytorch, Multi-GPU/single-GPU/CPU)

Overview

Pytorch version of Wolpertinger Training for cache environment with DDPG.
The code is compatible with training in multi-GPU, single-GPU or CPU.

Dependencies

python 3.6.8
torch 1.1.0
gym 0.14.0
pyflann
- This is the library (FLANN, Muja & Lowe, 2014) with approximate nearest-neighbor methods allowed for logarithmic-time lookup complexity relative to the number of actions. However, the python binding of FLANN (pyflann) is written for python 2 and is no longer maintained. Please refer to pyflann for the pyflann package compatible with python3. Just download and place it in your (virtual) environment.

Usage

To use CPU only:
```
$ python main.py --gpu-ids -1
```

To use single-GPU only:

$ python main.py --gpu-ids 0 --gpu-nums 1

To use multi-GPU (e.g., use GPU-0 and GPU-1):

$ python main.py --gpu-ids 0 1 --gpu-nums 2

You can set your experiment parameters in the arg_parser.py

Supplement

The train_test.py is used for the baseline experiment.
The train_test_window.py is used for the window experiment.

Result

Please refer to output for the trained policy and training log.
The runs is tensorboard result

Project Reference

Original paper of Wolpertinger Training with DDPG, Google DeepMind
I used and modified part of the code in https://github.com/ghliu/pytorch-ddpg under Apache License 2.0.
I used and modified part of the code in https://github.com/jimkon/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces under MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.idea		.idea
__pycache__		__pycache__
output/CacheContent_window-run1		output/CacheContent_window-run1
runs		runs
word2vec		word2vec
LICENSE		LICENSE
README.md		README.md
action_space.py		action_space.py
arg_parser.py		arg_parser.py
auxiliary_self_test.py		auxiliary_self_test.py
cache_env.py		cache_env.py
ddpg.py		ddpg.py
main.py		main.py
memory.py		memory.py
model.py		model.py
normalized_env.py		normalized_env.py
random_process.py		random_process.py
train_test.py		train_test.py
train_test_window.py		train_test_window.py
util.py		util.py
walks_test.model		walks_test.model
wolp_agent.py		wolp_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wolpertinger Training with DDPG for Cache Environment(Pytorch, Multi-GPU/single-GPU/CPU)

Overview

Dependencies

Usage

Supplement

Result

Project Reference

About

Releases

Packages

Languages

License

HAILANGREX/wolper

Folders and files

Latest commit

History

Repository files navigation

Wolpertinger Training with DDPG for Cache Environment(Pytorch, Multi-GPU/single-GPU/CPU)

Overview

Dependencies

Usage

Supplement

Result

Project Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages