| Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration | Sep 27, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience | Sep 24, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients | Sep 24, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning | Sep 23, 2021 | LEMMAMuJoCo | CodeCode Available | 1 |
| Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning | Sep 8, 2021 | Adversarial Attackcontinuous-control | —Unverified | 0 |
| Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning | Sep 6, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization | Aug 19, 2021 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Settling the Variance of Multi-Agent Policy Gradients | Aug 19, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| A general class of surrogate functions for stable and efficient reinforcement learning | Aug 12, 2021 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |
| A Pragmatic Look at Deep Imitation Learning | Aug 4, 2021 | Behavioural cloningD4RL | CodeCode Available | 0 |
| Tianshou: a Highly Modularized Deep Reinforcement Learning Library | Jul 29, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 |
| Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning | Jul 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| On the Benefits of Inducing Local Lipschitzness for Robust Generative Adversarial Imitation Learning | Jun 30, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning | Jun 30, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Unsupervised Skill Discovery with Bottleneck Option Learning | Jun 27, 2021 | DisentanglementMuJoCo | CodeCode Available | 1 |
| Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation | Jun 24, 2021 | MuJoCoOpenAI Gym | CodeCode Available | 2 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk | Jun 18, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| SparseDice: Imitation Learning for Temporally Sparse Data via Regularization | Jun 13, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| A Game-Theoretic Approach to Multi-Agent Trust Region Optimization | Jun 12, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Keyframe-Focused Visual Imitation Learning | Jun 11, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL | Jun 9, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| Average-Reward Reinforcement Learning with Trust Region Methods | Jun 7, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| DisTop: Discovering a Topological representation to learn diverse and rewarding skills | Jun 6, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage | Jun 6, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching | Jun 6, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture | May 28, 2021 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage | May 21, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Regret Minimization Experience Replay in Off-Policy Reinforcement Learning | May 15, 2021 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL | May 14, 2021 | Inductive BiasMeta Reinforcement Learning | —Unverified | 0 |
| An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with Pybullet | May 12, 2021 | MuJoCoMulti-Goal Reinforcement Learning | CodeCode Available | 1 |
| Context-Based Soft Actor Critic for Environments with Non-stationary Dynamics | May 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization | Apr 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Reinforcement Learning using Guided Observability | Apr 22, 2021 | Decision MakingMuJoCo | —Unverified | 0 |
| Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning | Apr 19, 2021 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| Reward function shape exploration in adversarial imitation learning: an empirical study | Apr 14, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning What To Do by Simulating the Past | Apr 8, 2021 | MuJoCo | CodeCode Available | 0 |
| No Need for Interactions: Robust Model-Based Imitation Learning using Neural ODE | Apr 3, 2021 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Hamiltonian Policy Optimization in Reinforcement Learning | Mar 23, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method | Mar 22, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Bayesian Distributional Policy Gradients | Mar 20, 2021 | Atari GamesContrastive Learning | —Unverified | 0 |
| Generalizable Episodic Memory for Deep Reinforcement Learning | Mar 11, 2021 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| A Quadratic Actor Network for Model-Free Reinforcement Learning | Mar 11, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning | Mar 10, 2021 | Contrastive LearningMeta Reinforcement Learning | —Unverified | 0 |
| Model-free Policy Learning with Reward Gradients | Mar 9, 2021 | Continuous Controlmodel | CodeCode Available | 1 |
| Hamiltonian Policy Optimization | Feb 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Action Redundancy in Reinforcement Learning | Feb 22, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| On Proximal Policy Optimization's Heavy-tailed Gradients | Feb 20, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Model-Invariant State Abstractions for Model-Based Reinforcement Learning | Feb 19, 2021 | continuous-controlContinuous Control | —Unverified | 0 |