Deep Reinforcement Learning with Smooth Policy Jan 1, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Improving the Generalization of Visual Navigation Policies using Invariance Regularization Jan 1, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning Jan 1, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning Jan 1, 2020 Benchmarking reinforcement-learning
Code Code Available 0Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation Jan 1, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Deep Reinforcement Learning with Implicit Human Feedback Jan 1, 2020 Atari Games Deep Reinforcement Learning
— Unverified 0OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning Jan 1, 2020 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Reinforcement Learning with Goal-Distance Gradient Jan 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Optimizing Multiagent Cooperation via Policy Evolution and Shared Experiences Jan 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0“Other-Play” for Zero-Shot Coordination Jan 1, 2020 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Way Off-Policy Batch Deep Reinforcement Learning of Human Preferences in Dialog Jan 1, 2020 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Reinforcement Learning with Differential Privacy Jan 1, 2020 Decision Making Privacy Preserving
— Unverified 0Responsive Safety in Reinforcement Learning Jan 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0The Natural Lottery Ticket Winner: Reinforcement Learning with Ordinary Neural Circuits Jan 1, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0SVQN: Sequential Variational Soft Q-Learning Networks Jan 1, 2020 Decision Making Q-Learning
— Unverified 0Reward-Conditioned Policies Dec 31, 2019 Imitation Learning reinforcement-learning
Code Code Available 0Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning Dec 31, 2019 Bayesian Inference Classification
— Unverified 0The Gambler's Problem and Beyond Dec 31, 2019 Q-Learning reinforcement-learning
— Unverified 0Information Theoretic Model Predictive Q-Learning Dec 31, 2019 Decision Making model
— Unverified 0A New Framework for Query Efficient Active Imitation Learning Dec 30, 2019 Imitation Learning Reinforcement Learning
— Unverified 0Deep Reinforced Self-Attention Masks for Abstractive Summarization (DR.SAS) Dec 30, 2019 Abstractive Text Summarization Reinforcement Learning
— Unverified 0World Programs for Model-Based Learning and Planning in Compositional State and Action Spaces Dec 30, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Speeding up reinforcement learning by combining attention and agency features Dec 29, 2019 Atari Games reinforcement-learning
— Unverified 0Real-time Policy Distillation in Deep Reinforcement Learning Dec 29, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Augmented Replay Memory in Reinforcement Learning With Continuous Control Dec 29, 2019 continuous-control Continuous Control
— Unverified 0Individual specialization in multi-task environments with multiagent reinforcement learners Dec 29, 2019 Fairness Multi-agent Reinforcement Learning
— Unverified 0Computational model discovery with reinforcement learning Dec 29, 2019 Deep Reinforcement Learning model
— Unverified 0SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning Dec 28, 2019 Atari Games Deep Reinforcement Learning
Code Code Available 0Weak Supervision for Fake News Detection via Reinforcement Learning Dec 28, 2019 Articles Fake News Detection
Code Code Available 0Quantum Logic Gate Synthesis as a Markov Decision Process Dec 27, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Evolution Strategies Converges to Finite Differences Dec 27, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Crowdfunding Dynamics Tracking: A Reinforcement Learning Approach Dec 27, 2019 continuous-control Continuous Control
— Unverified 0Deep reinforcement learning for complex evaluation of one-loop diagrams in quantum field theory Dec 27, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Quasi-Newton Trust Region Policy Optimization Dec 26, 2019 continuous-control Continuous Control
— Unverified 0Learning to Combat Compounding-Error in Model-Based Reinforcement Learning Dec 24, 2019 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Learning to Navigate Using Mid-Level Visual Priors Dec 23, 2019 Navigate reinforcement-learning
Code Code Available 0A Survey of Deep Reinforcement Learning in Video Games Dec 23, 2019 Deep Reinforcement Learning Real-Time Strategy Games
— Unverified 0Discrete and Continuous Action Representation for Practical RL in Video Games Dec 23, 2019 Control with Prametrised Actions Reinforcement Learning
Code Code Available 0Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time Dec 23, 2019 Q-Learning reinforcement-learning
— Unverified 0Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution Dec 23, 2019 Atari Games Board Games
Code Code Available 0Direct and indirect reinforcement learning Dec 23, 2019 Decision Making reinforcement-learning
— Unverified 0Variational Recurrent Models for Solving Partially Observable Control Tasks Dec 23, 2019 Deep Reinforcement Learning Memorization
Code Code Available 0Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning Dec 23, 2019 Efficient Exploration reinforcement-learning
Code Code Available 0Towards Practical Multi-Object Manipulation using Relational Reinforcement Learning Dec 23, 2019 Object reinforcement-learning
Code Code Available 0Monte-Carlo Tree Search for Policy Optimization Dec 23, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Energy-Aware Multi-Server Mobile Edge Computing: A Deep Reinforcement Learning Approach Dec 22, 2019 Deep Reinforcement Learning Edge-computing
— Unverified 0Can Agents Learn by Analogy? An Inferable Model for PAC Reinforcement Learning Dec 21, 2019 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards Dec 21, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes Dec 21, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Teaching robots to perceive time -- A reinforcement learning approach (Extended version) Dec 20, 2019 Gaussian Processes reinforcement-learning
— Unverified 0