On Proximal Policy Optimization's Heavy-tailed Gradients Feb 20, 2021 continuous-control Continuous Control
— Unverified 0Model-Invariant State Abstractions for Model-Based Reinforcement Learning Feb 19, 2021 continuous-control Continuous Control
— Unverified 0On the Sample Complexity of Stability Constrained Imitation Learning Feb 18, 2021 continuous-control Continuous Control
— Unverified 0Learning Memory-Dependent Continuous Control from Demonstrations Feb 18, 2021 continuous-control Continuous Control
— Unverified 0Q-Value Weighted Regression: Reinforcement Learning with Limited Data Feb 12, 2021 Atari Games continuous-control
Code Code Available 0Robust Policy Gradient against Strong Data Corruption Feb 11, 2021 continuous-control Continuous Control
Code Code Available 0Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning Feb 10, 2021 continuous-control Continuous Control
— Unverified 0Measuring Progress in Deep Reinforcement Learning Sample Efficiency Feb 9, 2021 Atari Games continuous-control
— Unverified 0Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning Jan 23, 2021 continuous-control Continuous Control
— Unverified 0ES-ENAS: Efficient Evolutionary Optimization for Large Hybrid Search Spaces Jan 19, 2021 Combinatorial Optimization Continuous Control
Code Code Available 0Linear Representation Meta-Reinforcement Learning for Instant Adaptation Jan 12, 2021 continuous-control Continuous Control
— Unverified 0CoachNet: An Adversarial Sampling Approach for Reinforcement Learning Jan 7, 2021 continuous-control Continuous Control
— Unverified 0Markov Chain Monte Carlo Policy Optimization Jan 4, 2021 continuous-control Continuous Control
— Unverified 0Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity Jan 4, 2021 continuous-control Continuous Control
— Unverified 0Sample efficient Quality Diversity for neural continuous control Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Offline Policy Optimization with Variance Regularization Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Genetic Soft Updates for Policy Evolution in Deep Reinforcement Learning Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Self-Supervised Continuous Control without Policy Gradient Jan 1, 2021 continuous-control Continuous Control
— Unverified 0TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous Control Jan 1, 2021 continuous-control Continuous Control
Code Code Available 0Divide-and-Conquer Monte Carlo Tree Search Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Factored Action Spaces in Deep Reinforcement Learning Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Robust Offline Reinforcement Learning from Low-Quality Data Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Explicit Pareto Front Optimization for Constrained Reinforcement Learning Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Unbiased learning with State-Conditioned Rewards in Adversarial Imitation Learning Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Unsupervised Task Clustering for Multi-Task Reinforcement Learning Jan 1, 2021 Atari Games Clustering
Code Code Available 0Error Controlled Actor-Critic Method to Reinforcement Learning Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Learning Subgoal Representations with Slow Dynamics Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Deep Reinforcement Learning With Adaptive Combined Critics Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control Jan 1, 2021 continuous-control Continuous Control
Code Code Available 0What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Jan 1, 2021 Attribute continuous-control
— Unverified 0Learning Efficient Planning-based Rewards for Imitation Learning Jan 1, 2021 Atari Games continuous-control
— Unverified 0Deep Coherent Exploration For Continuous Control Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Learning Latent Landmarks for Generalizable Planning Jan 1, 2021 continuous-control Continuous Control
— Unverified 0Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards Dec 26, 2020 continuous-control Continuous Control
Code Code Available 0Policy Optimization as Online Learning with Mediator Feedback Dec 15, 2020 continuous-control Continuous Control
— Unverified 0Policy Manifold Search for Improving Diversity-based Neuroevolution Dec 15, 2020 continuous-control Continuous Control
— Unverified 0OPAC: Opportunistic Actor-Critic Dec 11, 2020 continuous-control Continuous Control
— Unverified 0Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation Dec 9, 2020 continuous-control Continuous Control
— Unverified 0Proximal Policy Optimization Smoothed Algorithm Dec 4, 2020 continuous-control Continuous Control
— Unverified 0Reinforcement Learning for Control with Multiple Frequencies Dec 1, 2020 continuous-control Continuous Control
— Unverified 0On the Stability and Convergence of Robust Adversarial Reinforcement Learning: A Case Study on Linear Quadratic Systems Dec 1, 2020 continuous-control Continuous Control
— Unverified 0Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method Dec 1, 2020 continuous-control Continuous Control
— Unverified 0Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp Nov 30, 2020 continuous-control Continuous Control
Code Code Available 0Offline Learning from Demonstrations and Unlabeled Experience Nov 27, 2020 continuous-control Continuous Control
— Unverified 0Episodic Self-Imitation Learning with Hindsight Nov 26, 2020 continuous-control Continuous Control
Code Code Available 0C-Learning: Horizon-Aware Cumulative Accessibility Estimation Nov 24, 2020 continuous-control Continuous Control
Code Code Available 0Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games Nov 23, 2020 Continual Learning continuous-control
— Unverified 0Model-based Reinforcement Learning for Continuous Control with Posterior Sampling Nov 20, 2020 continuous-control Continuous Control
Code Code Available 0Nested Mixture of Experts: Cooperative and Competitive Learning of Hybrid Dynamical System Nov 20, 2020 continuous-control Continuous Control
— Unverified 0Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning Nov 13, 2020 continuous-control Continuous Control
— Unverified 0