Near-optimal Regret Bounds for Stochastic Shortest Path Feb 23, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Rapidly Personalizing Mobile Health Treatment Policies with Limited Data Feb 23, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Optimizing Traffic Lights with Multi-agent Deep Reinforcement Learning and V2X communication Feb 23, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations Feb 23, 2020 Atari Games Decision Making
Code Code Available 1Deep Reinforcement Learning with Linear Quadratic Regulator Regions Feb 23, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Adversarial Radar Inference. From Inverse Tracking to Inverse Reinforcement Learning of Cognitive Radar Feb 22, 2020 Reinforcement Learning (RL) Stochastic Optimization
— Unverified 0Automatic Data Augmentation via Deep Reinforcement Learning for Effective Kidney Tumor Segmentation Feb 22, 2020 Data Augmentation Deep Reinforcement Learning
— Unverified 0Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion Feb 22, 2020 Deep Reinforcement Learning Reinforcement Learning
— Unverified 0Vehicle Tracking in Wireless Sensor Networks via Deep Reinforcement Learning Feb 22, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Reinforcement Learning Framework for Deep Brain Stimulation Study Feb 22, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep Reinforcement Learning Approach Feb 21, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0On the Search for Feedback in Reinforcement Learning Feb 21, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning Feb 21, 2020 Atari Games Object
— Unverified 0Accelerating Reinforcement Learning with a Directional-Gaussian-Smoothing Evolution Strategy Feb 21, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Automatic Gesture Recognition in Robot-assisted Surgery with Reinforcement Learning and Tree Search Feb 20, 2020 Action Segmentation Gesture Recognition
— Unverified 0Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning Feb 20, 2020 Autonomous Navigation Deep Reinforcement Learning
— Unverified 0Adaptive Temporal Difference Learning with Linear Function Approximation Feb 20, 2020 OpenAI Gym reinforcement-learning
— Unverified 0oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions Feb 20, 2020 continuous-control Continuous Control
— Unverified 0Multi-Agent Reinforcement Learning as a Computational Tool for Language Evolution Research: Historical Context and Future Challenges Feb 20, 2020 BIG-bench Machine Learning Multi-agent Reinforcement Learning
— Unverified 0Multi-Agent Meta-Reinforcement Learning for Self-Powered and Sustainable Edge Computing Systems Feb 20, 2020 Edge-computing Meta Reinforcement Learning
— Unverified 0Debiased Off-Policy Evaluation for Recommendation Systems Feb 20, 2020 counterfactual Off-policy evaluation
— Unverified 0UAV Aided Search and Rescue Operation Using Reinforcement Learning Feb 19, 2020 Q-Learning reinforcement-learning
— Unverified 0Sim2Real Transfer for Reinforcement Learning without Dynamics Randomization Feb 19, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Value-driven Hindsight Modelling Feb 19, 2020 Atari Games Reinforcement Learning
— Unverified 0Optimistic Policy Optimization with Bandit Feedback Feb 19, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Efficient Deep Reinforcement Learning via Adaptive Policy Transfer Feb 19, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Curriculum in Gradient-Based Meta-Reinforcement Learning Feb 19, 2020 Benchmarking Meta-Learning
— Unverified 0How To Avoid Being Eaten By a Grue: Exploration Strategies for Text-Adventure Agents Feb 19, 2020 Knowledge Graphs reinforcement-learning
Code Code Available 1Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning Feb 19, 2020 continuous-control Continuous Control
— Unverified 0Generating Automatic Curricula via Self-Supervised Active Domain Randomization Feb 18, 2020 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Empirical Policy Evaluation with Supergraphs Feb 18, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Adaptive Estimator Selection for Off-Policy Evaluation Feb 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge Feb 18, 2020 Common Sense Reasoning continuous-control
— Unverified 0MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding Feb 18, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Molecular Design Guided by Quantum Mechanics Feb 18, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Multi-Issue Bargaining With Deep Reinforcement Learning Feb 18, 2020 continuous-control Continuous Control
— Unverified 0Langevin DQN Feb 17, 2020 Computational Efficiency Open-Ended Question Answering
Code Code Available 0Kalman meets Bellman: Improving Policy Evaluation through Value Tracking Feb 17, 2020 Gaussian Processes Reinforcement Learning
Code Code Available 1Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning Feb 17, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Adaptive Experience Selection for Policy Gradient Feb 17, 2020 continuous-control Continuous Control
— Unverified 0Reinforcement learning for the privacy preservation and manipulation of eye tracking data Feb 17, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Reward Design for Driver Repositioning Using Multi-Agent Reinforcement Learning Feb 17, 2020 Bayesian Optimization Bilevel Optimization
— Unverified 0Reinforced active learning for image segmentation Feb 16, 2020 Active Learning Deep Reinforcement Learning
Code Code Available 1R-MADDPG for Partially Observable Environments and Limited Communication Feb 16, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Investigating Simple Object Representations in Model-Free Deep Reinforcement Learning Feb 16, 2020 Deep Reinforcement Learning Object
— Unverified 0The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI Feb 15, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling Feb 15, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0PDDLGym: Gym Environments from PDDL Problems Feb 15, 2020 Decision Making OpenAI Gym
Code Code Available 1Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning Feb 15, 2020 Density Estimation Imitation Learning
Code Code Available 0Deep RL Agent for a Real-Time Action Strategy Game Feb 15, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1