Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence Jun 8, 2020 Reinforcement Learning (RL)
— Unverified 0Tools for Data-driven Modeling of Within-Hand Manipulation with Underactuated Adaptive Hands Jun 8, 2020 Model-based Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Online Data Poisoning Attacks Jun 8, 2020 Data Poisoning Deep Reinforcement Learning
— Unverified 0Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems Jun 8, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors Jun 8, 2020 model Model-based Reinforcement Learning
— Unverified 0Balancing a CartPole System with Reinforcement Learning -- A Tutorial Jun 8, 2020 OpenAI Gym Q-Learning
— Unverified 0Conservative Q-Learning for Offline Reinforcement Learning Jun 8, 2020 continuous-control Continuous Control
Code Code Available 1A Comparison of Self-Play Algorithms Under a Generalized Framework Jun 8, 2020 Reinforcement Learning (RL)
— Unverified 0A Decentralized Policy Gradient Approach to Multi-task Reinforcement Learning Jun 8, 2020 Atari Games Multi-Task Learning
— Unverified 0Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models Jun 8, 2020 Reinforcement Learning (RL)
— Unverified 0A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret Jun 8, 2020 Q-Learning reinforcement-learning
— Unverified 0Learning to Play No-Press Diplomacy with Best Response Policy Iteration Jun 8, 2020 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Stable Reinforcement Learning with Unbounded State Space Jun 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning Under Moral Uncertainty Jun 8, 2020 Autonomous Vehicles BIG-bench Machine Learning
Code Code Available 1Randomized Policy Learning for Continuous State and Action MDPs Jun 8, 2020 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Skill Discovery of Coordination in Multi-agent Reinforcement Learning Jun 7, 2020 Diversity Multi-agent Reinforcement Learning
— Unverified 0Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains Jun 7, 2020 Decision Making Hierarchical Reinforcement Learning
Code Code Available 1Real-Time Model Calibration with Deep Reinforcement Learning Jun 7, 2020 Deep Reinforcement Learning model
— Unverified 0Multi-Task Reinforcement Learning based Mobile Manipulation Control for Dynamic Object Tracking and Grasping Jun 7, 2020 Object Object Tracking
— Unverified 0Efficient Poverty Mapping using Deep Reinforcement Learning Jun 7, 2020 Deep Reinforcement Learning object-detection
— Unverified 0Dual Policy Distillation Jun 7, 2020 continuous-control Continuous Control
Code Code Available 0Implications of Human Irrationality for Reinforcement Learning Jun 7, 2020 BIG-bench Machine Learning Decision Making
— Unverified 0Incorporating Pragmatic Reasoning Communication into Emergent Language Jun 7, 2020 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning Jun 7, 2020 counterfactual Multi-agent Reinforcement Learning
Code Code Available 1Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning Jun 6, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity Jun 6, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity Jun 6, 2020 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Stable and Efficient Policy Evaluation Jun 6, 2020 Reinforcement Learning (RL)
— Unverified 0State Action Separable Reinforcement Learning Jun 5, 2020 Decision Making reinforcement-learning
— Unverified 0AutoHAS: Efficient Hyperparameter and Architecture Search Jun 5, 2020 AutoML Hyperparameter Optimization
— Unverified 0Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization Jun 5, 2020 Offline RL reinforcement-learning
Code Code Available 1Curiosity Killed or Incapacitated the Cat and the Asymptotically Optimal Agent Jun 5, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Balancing Reinforcement Learning Training Experiences in Interactive Information Retrieval Jun 5, 2020 Information Retrieval reinforcement-learning
— Unverified 0Visual Transfer for Reinforcement Learning via Wasserstein Domain Confusion Jun 4, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Meta-Model-Based Meta-Policy Optimization Jun 4, 2020 continuous-control Continuous Control
— Unverified 0Single-step deep reinforcement learning for open-loop control of laminar and turbulent flows Jun 4, 2020 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning Jun 4, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Refined Continuous Control of DDPG Actors via Parametrised Activation Jun 4, 2020 continuous-control Continuous Control
— Unverified 0Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty Jun 4, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines Jun 4, 2020 Q-Learning reinforcement-learning
Code Code Available 0Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains Jun 3, 2020 Autonomous Driving Causal Inference
— Unverified 0Interferobot: aligning an optical interferometer by a reinforcement learning agent Jun 3, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Learning to Scan: A Deep Reinforcement Learning Approach for Personalized Scanning in CT Imaging Jun 3, 2020 compressed sensing Computed Tomography (CT)
— Unverified 0The Value-Improvement Path: Towards Better Representations for Reinforcement Learning Jun 3, 2020 Atari Games reinforcement-learning
— Unverified 0Temporally-Extended ε-Greedy Exploration Jun 2, 2020 Reinforcement Learning (RL)
Code Code Available 0Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent Jun 2, 2020 Deep Reinforcement Learning Policy Gradient Methods
Code Code Available 0Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration Jun 2, 2020 Diversity Efficient Exploration
Code Code Available 0Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization Jun 2, 2020 Combinatorial Optimization Deep Reinforcement Learning
Code Code Available 1Active Vision for Early Recognition of Human Actions Jun 1, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning Jun 1, 2020 Face Recognition Fairness
— Unverified 0