Deep Pepper: Expert Iteration based Chess agent in the Reinforcement Learning Setting Jun 2, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Efficient Entropy for Policy Gradient with Multidimensional Action Space Jun 2, 2018 Atari Games Deep Reinforcement Learning
— Unverified 0Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition Jun 1, 2018 Action Recognition Deep Reinforcement Learning
— Unverified 0Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning Jun 1, 2018 Model-based Reinforcement Learning Object
— Unverified 0Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling Jun 1, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Bootstrapping a Neural Conversational Agent with Dialogue Self-Play, Crowdsourcing and On-Line Reinforcement Learning Jun 1, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Improved Sample Complexity for Stochastic Compositional Variance Reduced Gradient Jun 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing Jun 1, 2018 Bayesian Inference reinforcement-learning
— Unverified 0A Reinforcement Learning Approach to Age of Information in Multi-User Networks Jun 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Reinforcement Learning of Region Proposal Networks for Object Detection Jun 1, 2018 Deep Reinforcement Learning Object
Code Code Available 0Environment Upgrade Reinforcement Learning for Non-Differentiable Multi-Stage Pipelines Jun 1, 2018 Instance Segmentation Pose Estimation
— Unverified 0GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning Jun 1, 2018 Binarization Deep Reinforcement Learning
— Unverified 0Equivalence Between Wasserstein and Value-Aware Loss for Model-based Reinforcement Learning Jun 1, 2018 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0SeedNet: Automatic Seed Generation With Deep Reinforcement Learning for Robust Interactive Segmentation Jun 1, 2018 Deep Reinforcement Learning Interactive Segmentation
— Unverified 0Mining Evidences for Concept Stock Recommendation Jun 1, 2018 Deep Reinforcement Learning Information Retrieval
— Unverified 0Quality Signals in Generated Stories Jun 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Sequential Attacks on Agents for Long-Term Adversarial Goals May 31, 2018 Adversarial Attack Reinforcement Learning
— Unverified 0Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation May 31, 2018 Image-to-Image Translation Imitation Learning
Code Code Available 0Reinforced Continual Learning May 31, 2018 Continual Learning General Classification
Code Code Available 0Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update May 31, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Learning a Prior over Intent via Meta-Inverse Reinforcement Learning May 31, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Evaluating Reinforcement Learning Algorithms in Observational Health Settings May 31, 2018 BIG-bench Machine Learning Decision Making
— Unverified 0Adversarial Learning of Task-Oriented Neural Dialog Models May 30, 2018 Dialog Learning Reinforcement Learning
— Unverified 0Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Exploration in Reinforcement Learning May 29, 2018 Bayesian Inference reinforcement-learning
Code Code Available 0Depth and nonlinearity induce implicit exploration for RL May 29, 2018 Q-Learning reinforcement-learning
— Unverified 0Observe and Look Further: Achieving Consistent Performance on Atari May 29, 2018 Atari Games Deep Reinforcement Learning
— Unverified 0Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition May 29, 2018 continuous-control Continuous Control
— Unverified 0Supervised Policy Update for Deep Reinforcement Learning May 29, 2018 Deep Reinforcement Learning MuJoCo
Code Code Available 0Virtuously Safe Reinforcement Learning May 29, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning May 29, 2018 Imitation Learning reinforcement-learning
— Unverified 0Value Propagation Networks May 28, 2018 Navigate reinforcement-learning
— Unverified 0Memory Augmented Self-Play May 28, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Hierarchical clustering with deep Q-learning May 28, 2018 Clustering Q-Learning
— Unverified 0Importance Weighted Transfer of Samples in Reinforcement Learning May 28, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Fingerprint Policy Optimisation for Robust Reinforcement Learning May 27, 2018 Bayesian Optimisation Continuous Control
— Unverified 0Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning May 27, 2018 Machine Translation NMT
Code Code Available 0Fast Policy Learning through Imitation and Reinforcement May 26, 2018 Imitation Learning Reinforcement Learning
— Unverified 0Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation May 26, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Finite Sample Analysis of LSTD with Random Projections and Eligibility Traces May 25, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Detecting Deceptive Reviews using Generative Adversarial Networks May 25, 2018 General Classification Reinforcement Learning
— Unverified 0A Sliding-Window Algorithm for Markov Decision Processes with Arbitrarily Changing Rewards and Transitions May 25, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning May 25, 2018 Imitation Learning reinforcement-learning
Code Code Available 0Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming May 25, 2018 Bayesian Inference Multi-Armed Bandits
Code Code Available 0Reinforced Extractive Summarization with Question-Focused Rewards May 25, 2018 Extractive Summarization reinforcement-learning
— Unverified 0Visceral Machines: Risk-Aversion in Reinforcement Learning with Intrinsic Physiological Rewards May 25, 2018 Navigate reinforcement-learning
Code Code Available 0Resource Allocation for a Wireless Coexistence Management System Based on Reinforcement Learning May 24, 2018 Management reinforcement-learning
— Unverified 0Meta-Gradient Reinforcement Learning May 24, 2018 Meta-Learning reinforcement-learning
Code Code Available 0Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning May 24, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0A0C: Alpha Zero in Continuous Action Space May 24, 2018 Board Games reinforcement-learning
Code Code Available 0Intelligent Trainer for Model-Based Reinforcement Learning May 24, 2018 model Model-based Reinforcement Learning
Code Code Available 0