Zeroth-Order Supervised Policy Improvement Jun 11, 2020 continuous-control Continuous Control
— Unverified 0Surveys without Questions: A Reinforcement Learning Approach Jun 11, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation Jun 11, 2020 Learning Theory reinforcement-learning
— Unverified 0Multi-Agent Informational Learning Processes Jun 11, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward Jun 11, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Off-Policy Risk-Sensitive Reinforcement Learning Based Constrained Robust Optimal Control Jun 10, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Q-greedyUCB: a New Exploration Policy for Adaptive and Resource-efficient Scheduling Jun 10, 2020 Decision Making Q-Learning
— Unverified 0Continuous Action Reinforcement Learning from a Mixture of Interpretable Experts Jun 10, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Machine learning and control engineering: The model-free case Jun 10, 2020 BIG-bench Machine Learning reinforcement-learning
— Unverified 0Searching Learning Strategy with Reinforcement Learning for 3D Medical Image Segmentation Jun 10, 2020 Data Augmentation Image Segmentation
— Unverified 0Privacy-Cost Management in Smart Meters with Mutual Information-Based Reinforcement Learning Jun 10, 2020 Deep Reinforcement Learning Management
— Unverified 0Multi-Agent Reinforcement Learning in a Realistic Limit Order Book Market Simulation Jun 10, 2020 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning Jun 10, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Self-Supervised Reinforcement Learning for Recommender Systems Jun 10, 2020 Q-Learning Recommendation Systems
— Unverified 0Deep reinforcement learning for optical systems: A case study of mode-locked lasers Jun 10, 2020 Deep Reinforcement Learning Navigate
— Unverified 0Learning to Play Table Tennis From Scratch using Muscular Robots Jun 10, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Development of A Stochastic Traffic Environment with Generative Time-Series Models for Improving Generalization Capabilities of Autonomous Driving Agents Jun 10, 2020 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Causal Discovery from Incomplete Data using An Encoder and Reinforcement Learning Jun 9, 2020 Causal Discovery Imputation
— Unverified 0An overall view of key problems in algorithmic trading and recent progress Jun 9, 2020 Algorithmic Trading BIG-bench Machine Learning
— Unverified 0Distributed Learning on Heterogeneous Resource-Constrained Devices Jun 9, 2020 Federated Learning Reinforcement Learning (RL)
— Unverified 0Stealing Deep Reinforcement Learning Models for Fun and Profit Jun 9, 2020 Decision Making Deep Reinforcement Learning
— Unverified 0Policy-focused Agent-based Modeling using RL Behavioral Models Jun 9, 2020 Decision Making Reinforcement Learning (RL)
— Unverified 0Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior Jun 9, 2020 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Variational Model-based Policy Optimization Jun 9, 2020 continuous-control Continuous Control
— Unverified 0Online Data Poisoning Attacks Jun 8, 2020 Data Poisoning Deep Reinforcement Learning
— Unverified 0Randomized Policy Learning for Continuous State and Action MDPs Jun 8, 2020 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Tools for Data-driven Modeling of Within-Hand Manipulation with Underactuated Adaptive Hands Jun 8, 2020 Model-based Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Stable Reinforcement Learning with Unbounded State Space Jun 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems Jun 8, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence Jun 8, 2020 Reinforcement Learning (RL)
— Unverified 0Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors Jun 8, 2020 model Model-based Reinforcement Learning
— Unverified 0Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models Jun 8, 2020 Reinforcement Learning (RL)
— Unverified 0Learning to Plan via Deep Optimistic Value Exploration Jun 8, 2020 Benchmarking Model-based Reinforcement Learning
— Unverified 0Balancing a CartPole System with Reinforcement Learning -- A Tutorial Jun 8, 2020 OpenAI Gym Q-Learning
— Unverified 0A Decentralized Policy Gradient Approach to Multi-task Reinforcement Learning Jun 8, 2020 Atari Games Multi-Task Learning
— Unverified 0A Comparison of Self-Play Algorithms Under a Generalized Framework Jun 8, 2020 Reinforcement Learning (RL)
— Unverified 0A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret Jun 8, 2020 Q-Learning reinforcement-learning
— Unverified 0Constrained Upper Confidence Reinforcement Learning with Known Dynamics Jun 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Learning the model-free linear quadratic regulator via random search Jun 8, 2020 Reinforcement Learning (RL)
— Unverified 0Dual Policy Distillation Jun 7, 2020 continuous-control Continuous Control
Code Code Available 0Implications of Human Irrationality for Reinforcement Learning Jun 7, 2020 BIG-bench Machine Learning Decision Making
— Unverified 0Efficient Poverty Mapping using Deep Reinforcement Learning Jun 7, 2020 Deep Reinforcement Learning object-detection
— Unverified 0Incorporating Pragmatic Reasoning Communication into Emergent Language Jun 7, 2020 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Multi-Task Reinforcement Learning based Mobile Manipulation Control for Dynamic Object Tracking and Grasping Jun 7, 2020 Object Object Tracking
— Unverified 0Skill Discovery of Coordination in Multi-agent Reinforcement Learning Jun 7, 2020 Diversity Multi-agent Reinforcement Learning
— Unverified 0Real-Time Model Calibration with Deep Reinforcement Learning Jun 7, 2020 Deep Reinforcement Learning model
— Unverified 0Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity Jun 6, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Stable and Efficient Policy Evaluation Jun 6, 2020 Reinforcement Learning (RL)
— Unverified 0Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning Jun 6, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Curiosity Killed or Incapacitated the Cat and the Asymptotically Optimal Agent Jun 5, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0