Skip to content

Latest commit

 

History

History
362 lines (312 loc) · 16.8 KB

README.md

File metadata and controls

362 lines (312 loc) · 16.8 KB

Awesome RL

Awesome Static Badge Static Badge Static Badge

Contents


Books


Courses


RL Research Topics

  • Approximate Dynamic Programming and Offline RL

    Approximate Dynamic Programming (ADP) concerns obtaining approximate solutions to large planning problems, often with the help of sampling and function approximation. Many ADP methods can be considered as prototype algorithms for popular value-based RL algorithms used today, especially in the offline setting, so it is important to understand their behaviors and guarantees.

    • Online + Offline (Hybrid)

      • Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
      • Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
      • Hybrid rl: Using both offline and online data can make rl efficient
  • Multi-agent RL

  • Off-policy Evaluation

    How to estimate the performance of a policy using data collected from a different policy? This question has important implications in safety and real-world applications of RL.


GitHub Repo


Website


Activity


Application


Community


Conference & Journal

Conference: NIPS, ICML, ICLR, AAAI, IJCAI, AAMAS, IROS, etc.

Journal: JMLR, JAIR, JAAMAS, etc.


Research Group

Other outer link

⬆ back to top


Industry Group

⬆ back to top


Misc

⬆ back to top


Discussion

  1. Policy-based vs. Value-based [ZhiHu]
  2. Philosophy of Reinforcement Learning

⬆ back to top


Contributing

This is an active repository and it is time-consuming to maintain the content. So your contributions really matter!

If you find it helpful, please vote for it by adding 👍.

If you have any question about this list, do not hesitate to contact me at [email protected].

⬆ back to top


Reference

⬆ back to top