Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application Mar 2, 2018 Decision Making Learning-To-Rank
Code Code Available 0Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach Mar 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling Mar 1, 2018 Active Learning reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Sponsored Search Real-time Bidding Mar 1, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0On Oracle-Efficient PAC RL with Rich Observations Mar 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Hierarchical Imitation and Reinforcement Learning Mar 1, 2018 Decision Making Imitation Learning
— Unverified 0Learning by Playing - Solving Sparse Reward Tasks from Scratch Feb 28, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Model-Ensemble Trust-Region Policy Optimization Feb 28, 2018 continuous-control Continuous Control
Code Code Available 0Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning Feb 28, 2018 continuous-control Continuous Control
— Unverified 0Deep Reinforcement Learning for Join Order Enumeration Feb 28, 2018 Decision Making Deep Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods Feb 28, 2018 Deep Reinforcement Learning Diversity
Code Code Available 0DiGrad: Multi-Task Reinforcement Learning with Shared Actions Feb 27, 2018 Multi-Task Learning reinforcement-learning
— Unverified 0Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising Feb 27, 2018 Clustering Multi-agent Reinforcement Learning
— Unverified 0The Mirage of Action-Dependent Baselines in Reinforcement Learning Feb 27, 2018 Policy Gradient Methods reinforcement-learning
Code Code Available 0Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 26, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 0Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research Feb 26, 2018 continuous-control Continuous Control
Code Code Available 0Modeling Others using Oneself in Multi-Agent Reinforcement Learning Feb 26, 2018 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Variance Reduction Methods for Sublinear Reinforcement Learning Feb 26, 2018 Q-Learning reinforcement-learning
— Unverified 0Reinforcement and Imitation Learning for Diverse Visuomotor Skills Feb 26, 2018 Deep Reinforcement Learning Imitation Learning
Code Code Available 0Addressing Function Approximation Error in Actor-Critic Methods Feb 26, 2018 Continuous Control OpenAI Gym
Code Code Available 1Temporal Difference Models: Model-Free Deep RL for Model-Based Control Feb 25, 2018 continuous-control Continuous Control
— Unverified 0Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration Feb 24, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 1Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari Feb 24, 2018 Atari Games Benchmarking
Code Code Available 0Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents Feb 23, 2018 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Budget Constrained Bidding by Model-free Reinforcement Learning in Display Advertising Feb 23, 2018 Marketing reinforcement-learning
— Unverified 0Verifying Controllers Against Adversarial Examples with Bayesian Optimization Feb 23, 2018 Bayesian Optimization reinforcement-learning
Code Code Available 0Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments Feb 23, 2018 Deep Reinforcement Learning Q-Learning
— Unverified 0Ranking Sentences for Extractive Summarization with Reinforcement Learning Feb 23, 2018 Document Summarization Extractive Summarization
Code Code Available 0Structured Control Nets for Deep Reinforcement Learning Feb 22, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 0An Analysis of Categorical Distributional Reinforcement Learning Feb 22, 2018 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Diverse Exploration for Fast and Safe Policy Improvement Feb 22, 2018 Diversity reinforcement-learning
— Unverified 0Variational Inference for Policy Gradient Feb 21, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Meta-Reinforcement Learning of Structured Exploration Strategies Feb 20, 2018 Meta Reinforcement Learning reinforcement-learning
Code Code Available 1Continual Reinforcement Learning with Complex Synapses Feb 20, 2018 Continual Learning Deep Reinforcement Learning
— Unverified 0Accelerated Primal-Dual Policy Optimization for Safe Reinforcement Learning Feb 19, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Fourier Policy Gradients Feb 19, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning Feb 19, 2018 Deep Reinforcement Learning Recommendation Systems
— Unverified 0Improving Mild Cognitive Impairment Prediction via Reinforcement Learning and Dialogue Simulation Feb 18, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Estimating scale-invariant future in continuous time Feb 18, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management Feb 18, 2018 Deep Reinforcement Learning Management
Code Code Available 0Bridging Cognitive Programs and Machine Learning Feb 16, 2018 BIG-bench Machine Learning reinforcement-learning
— Unverified 0Modeling the Formation of Social Conventions from Embodied Real-Time Interactions Feb 16, 2018 Decision Making Fairness
— Unverified 0Reactive Reinforcement Learning in Asynchronous Environments Feb 16, 2018 Decision Making reinforcement-learning
— Unverified 0Monte Carlo Q-learning for General Game Playing Feb 16, 2018 Board Games Q-Learning
Code Code Available 0Diversity is All You Need: Learning Skills without a Reward Function Feb 16, 2018 All Diversity
Code Code Available 1Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays Feb 15, 2018 Hippocampus Q-Learning
— Unverified 0Mean Field Multi-Agent Reinforcement Learning Feb 15, 2018 Multi-agent Reinforcement Learning Q-Learning
Code Code Available 1Reinforcement Learning from Imperfect Demonstrations Feb 14, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0From Gameplay to Symbolic Reasoning: Learning SAT Solver Heuristics in the Style of Alpha(Go) Zero Feb 14, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 0GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms Feb 14, 2018 Deep Reinforcement Learning Diversity
Code Code Available 0