Actor Critic with PPO

For intuitive guide to the mechanics of actor-critic methods check out accompanying comic.

Notebook designed for readability and exploration rather than production. Uses a single GPU. For an industrial-strength PPO in PyTorch check out ikostrikov's. For the 'definitive' implementation of PPO, check out OpenAI baselines (tensorflow). For outstanding resources on RL check out OpenAI's Spinning Up

The notebook reproduces results from OpenAI's procedually-generated environments and corresponding paper (Cobbe 2019). All hyperparameters taken directly from paper. Built from scratch unless otherwise noted to gain intuition.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
A2C PPO.ipynb		A2C PPO.ipynb
README.md		README.md
models.py		models.py
starpilot agent.torch		starpilot agent.torch
starpilot_agent_run.avi		starpilot_agent_run.avi
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Actor Critic with PPO

About

Releases

Packages

Languages

LARS12llt/simple-A2C-PPO

Folders and files

Latest commit

History

Repository files navigation

Actor Critic with PPO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages