Uncertainty Aware RL

🎓 Guided Policy Search

as the title, we learn policy guided from ilqr optimization.

we use bnn as dynamic model

overall process of algorithm is like this

$$ 1. \ randomly \ choose \ \pi_{ilqr} \ or \ \pi_\theta \ and \ implement. $$

$$ 2. \ learn \ dynamic \ by \ bnn $$

$$ 3. \ learn \ \pi_{ilqr} \ and \ \pi_\theta \ by \ using \ bnn $$

detail of process 3 is like below, dual gradient descent

first we set cost = f + $\lambda (constraint)$ which is lagrangian form

we name this cost as L($x^{*}(\lambda), \lambda$)

$x^{*}(\lambda)$ means trajectory $\tau $ and network parameter $\theta $

update rule is like this

$$1. \ \tau \leftarrow argmin_\tau L(\tau, \theta, \lambda) $$

$$2. \ \theta \leftarrow argmin_\theta L(\tau, \theta, \lambda) $$

$$3. \ \lambda \leftarrow \lambda + \alpha * {dg \over d\lambda } $$

🌍 Experiment Environments

Cartpole
Hopper

📦 Requirements

Gym
Mujoco
Python >= 3.8
Pytorch >= 1.12.0
Numpy

📚 Papers & References

iLQR: TassaIROS12
MDGPS: Reset-Free Guided Policy Search: Efficient Deep Reinforcement Learning with Stochastic Initial States
GPS: Guided Policy Search
CS285: Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
NeuralNetwork		NeuralNetwork
Parameter/hopegps		Parameter/hopegps
Result		Result
control		control
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
executable.py		executable.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Uncertainty Aware RL

🎓 Guided Policy Search

🌍 Experiment Environments

📦 Requirements

📚 Papers & References

About

Releases

Packages

Languages

License

kkugosu/Uncertainty-Aware-RL

Folders and files

Latest commit

History

Repository files navigation

Uncertainty Aware RL

🎓 Guided Policy Search

🌍 Experiment Environments

📦 Requirements

📚 Papers & References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages