This is a resource about game playing with reinforcement learning, game theory and so on.
This project will gather newest papers and classify them gradually.(f.e. neurips 2018)
It will be updated continuously, welcome to contribute to us!.
focus on texas hold'em and so on
include alpha go series
- Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion. paper
- Data-efficient model-based reinforcement learning with deep probabilistic dynamics models. paper
- Iterative Value-Aware Model Learning. paper
- Data center cooling using model-predictive control. paper
- Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation. paper
- Representation Balancing MDPs for Off-Policy Policy Evaluation. paper
- Simple random search provides a competitive approach to reinforcement learning. paper
- Non-delusional Q-learning and value iteration. paper
- Actor-Critic Policy Optimization in Partially Observable Multiagent Environments. paper
- Learning Temporal Point Processes via Reinforcement Learning. paper
- Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing. paper code
- Flexible Neural Representation for Physics Prediction. paper
- Data-Efficient Hierarchical Reinforcement Learning. paper
- Learning Abstract Options. paper
- Dual Policy Iteration. paper
- differentiable mpc for end-to-end planning and control. paper
- Learning Plannable Representations with Causal InfoGAN. paper
- Multi-Agent Generative Adversarial Imitation Learning. paper
- an event-based framework for task specification and control. paper
- Context-Dependent Upper-Confidence Bounds for Directed Exploration. paper
- Playing hard exploration games by watching YouTube. paper
- Unsupervised Video Object Segmentation for Deep Reinforcement Learning. paper
- Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation. paper
- Visual Reinforcement Learning with Imagined Goals. paper
- Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding. paper
- Meta-Reinforcement Learning of Structured Exploration Strategies. paper
- Evolved Policy Gradients. paper
- Neural Arithmetic Logic Units. paper
- Meta-Learning MCMC Proposals. paper
- Probabilistic Model-Agnostic Meta-Learning. paper
- Meta-Gradient Reinforcement Learning. paper
- Deep Generative Models with Learnable Knowledge Constraints. paper
- Are GANs Created Equal? A Large-Scale Study. paper
- Human-in-the-Loop Interpretability Prior. paper
- Towards Robust Interpretability with Self-Explaining Neural Networks. paper
- end-to-end differentiable physics for learning and control. paper code
- Recurrent World Models Facilitate Policy Evolution. paper
- Learning to Play with Intrinsically-Motivated Self-Aware Agents. paper
- Reward learning from human preferences and demonstrations in Atari. paper
- On Learning Intrinsic Rewards for Policy Gradient Methods. paper
- DeepProbLog: Neural Probabilistic Logic Programming. paper
- Scalable End-to-End Autonomous Vehicle Testing via Rare-event Simulation. paper
- Relational recurrent neural networks. paper code
- How Does Batch Normalization Help Optimization? paper
- Randomized Prior Functions for Deep Reinforcement Learning. paper
- Transfer Learning with Neural AutoML. paper
- Neural Guided Constraint Logic Programming for Program Synthesis. paper code