ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games Jul 4, 2017 Atari Games GPU
Code Code Available 0Maintaining cooperation in complex social dilemmas using deep reinforcement learning Jul 4, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0OPEB: Open Physical Environment Benchmark for Artificial Intelligence Jul 4, 2017 continuous-control Continuous Control
— Unverified 0Efficient Probabilistic Performance Bounds for Inverse Reinforcement Learning Jul 3, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning Jul 3, 2017 Deep Reinforcement Learning Efficient Exploration
— Unverified 0Grammatical Error Correction with Neural Reinforcement Learning Jul 2, 2017 Decoder Grammatical Error Correction
— Unverified 0Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning Jul 1, 2017 Deep Reinforcement Learning GPU
Code Code Available 0Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management Jul 1, 2017 Deep Reinforcement Learning Dialogue Management
— Unverified 0Neural Sequence Model Training via α-divergence Minimization Jun 30, 2017 Machine Translation model
Code Code Available 0Noisy Networks for Exploration Jun 30, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 0Neural SLAM: Learning to Explore with External Memory Jun 29, 2017 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Path Integral Networks: End-to-End Differentiable Optimal Control Jun 29, 2017 continuous-control Continuous Control
— Unverified 0Actor-Critic Sequence Training for Image Captioning Jun 29, 2017 AI Agent Image Captioning
— Unverified 0Learning to Learn: Meta-Critic Networks for Sample Efficient Learning Jun 29, 2017 Meta-Learning reinforcement-learning
— Unverified 0Interpretability via Model Extraction Jun 29, 2017 BIG-bench Machine Learning model
— Unverified 0Uncertainty Decomposition in Bayesian Neural Networks with Latent Variables Jun 26, 2017 Active Learning reinforcement-learning
— Unverified 0Count-Based Exploration in Feature Space for Reinforcement Learning Jun 25, 2017 Atari Games Efficient Exploration
Code Code Available 0Temporal-related Convolutional-Restricted-Boltzmann-Machine capable of learning relational order via reinforcement learning procedure? Jun 24, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning Jun 22, 2017 Action Detection Position
Code Code Available 0Structure Learning in Motor Control:A Deep Reinforcement Learning Model Jun 21, 2017 Deep Reinforcement Learning Model-based Reinforcement Learning
— Unverified 0Observational Learning by Reinforcement Learning Jun 20, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Toward Real-Time Decentralized Reinforcement Learning using Finite Support Basis Functions Jun 20, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines Jun 20, 2017 Policy Gradient Methods reinforcement-learning
— Unverified 0Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control Jun 20, 2017 Gaussian Processes Model Predictive Control
Code Code Available 0Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning Jun 19, 2017 Continual Learning Deep Reinforcement Learning
Code Code Available 0Pedestrian Prediction by Planning using Deep Neural Networks Jun 19, 2017 Autonomous Vehicles Collision Avoidance
— Unverified 0Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning Jun 19, 2017 Dialogue Management Hierarchical Reinforcement Learning
— Unverified 0Reinforcement Learning under Model Mismatch Jun 15, 2017 model Q-Learning
— Unverified 0Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning Jun 15, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations Jun 15, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Reinforcement Learning with Budget-Constrained Nonparametric Function Approximation for Opportunistic Spectrum Access Jun 14, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0On Optimistic versus Randomized Exploration in Reinforcement Learning Jun 13, 2017 Computational Efficiency reinforcement-learning
— Unverified 0Hybrid Reward Architecture for Reinforcement Learning Jun 13, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Device Placement Optimization with Reinforcement Learning Jun 13, 2017 Language Modeling Language Modelling
Code Code Available 0Deep reinforcement learning from human preferences Jun 12, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 0ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning Jun 10, 2017 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Symmetry Learning for Function Approximation in Reinforcement Learning Jun 9, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Unlocking the Potential of Simulators: Design with RL in Mind Jun 8, 2017 Decision Making Friction
— Unverified 0Efficient Reinforcement Learning via Initial Pure Exploration Jun 7, 2017 Multi-Armed Bandits reinforcement-learning
— Unverified 0Parameter Space Noise for Exploration Jun 6, 2017 continuous-control Continuous Control
Code Code Available 0Towards Synthesizing Complex Programs from Input-Output Examples Jun 5, 2017 Program Synthesis reinforcement-learning
— Unverified 0UCB Exploration via Q-Ensembles Jun 5, 2017 Deep Reinforcement Learning Q-Learning
— Unverified 0A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming Jun 5, 2017 Decision Making Reinforcement Learning
— Unverified 0Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics Jun 4, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning Jun 1, 2017 continuous-control Continuous Control
— Unverified 0Reinforcement Learning for Learning Rate Control May 31, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0The Atari Grand Challenge Dataset May 31, 2017 Imitation Learning Reinforcement Learning
Code Code Available 0Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget May 31, 2017 Decision Making Feature Engineering
— Unverified 0Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models May 30, 2017 Molecular Graph Generation Music Generation
Code Code Available 0Universal Reinforcement Learning Algorithms: Survey and Experiments May 30, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0