Skip to content

Latest commit

 

History

History
79 lines (67 loc) · 3.08 KB

README.md

File metadata and controls

79 lines (67 loc) · 3.08 KB

A Study of Partial Observability in
Multi-Agent Reinforcement Learning



Paper   •   Contact Us

drawing drawing drawing drawing

Simple-Spread task: Agents with different partial observation settings can achieve comparable performance with near-optimality. From left to right: agents can observe nearby 2, 4, 6, 8 agents.

Installation

$ conda env create -f environment.yml

Train the agents

$ cd scripts
$ ./run_mpe_batch.sh

Results

The pretrained simple-spread models can be found in results/MPE/simple_spread/ramppo/models

$ cd scripts
$ ./render_mpe.sh

Citation

If you use this dataset for any academic work, please cite the following publication:

@misc{wenshuai2023less,
    title={Less Is More: Robust Robot Learning via Partially Observable Multi-Agent Reinforcement Learning}, 
    author={Wenshuai Zhao and Eetu Rantala and Joni Pajarinen and Jorge Peña Queralta},
    year={2023},
    eprint={},
    archivePrefix={arXiv},
    primaryClass={cs.RO}
}