| Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization | Apr 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Reinforcement Learning using Guided Observability | Apr 22, 2021 | Decision MakingMuJoCo | —Unverified | 0 |
| Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning | Apr 19, 2021 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| Reward function shape exploration in adversarial imitation learning: an empirical study | Apr 14, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning What To Do by Simulating the Past | Apr 8, 2021 | MuJoCo | CodeCode Available | 0 |
| No Need for Interactions: Robust Model-Based Imitation Learning using Neural ODE | Apr 3, 2021 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Hamiltonian Policy Optimization in Reinforcement Learning | Mar 23, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method | Mar 22, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Bayesian Distributional Policy Gradients | Mar 20, 2021 | Atari GamesContrastive Learning | —Unverified | 0 |
| A Quadratic Actor Network for Model-Free Reinforcement Learning | Mar 11, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning | Mar 10, 2021 | Contrastive LearningMeta Reinforcement Learning | —Unverified | 0 |
| Hamiltonian Policy Optimization | Feb 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Action Redundancy in Reinforcement Learning | Feb 22, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| On Proximal Policy Optimization's Heavy-tailed Gradients | Feb 20, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Model-Invariant State Abstractions for Model-Based Reinforcement Learning | Feb 19, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels | Feb 19, 2021 | MuJoCo | —Unverified | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |
| Robust Policy Gradient against Strong Data Corruption | Feb 11, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Variance Penalized On-Policy and Off-Policy Actor-Critic | Feb 3, 2021 | MuJoCo | CodeCode Available | 0 |
| GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning | Jan 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Hellinger Distance Constrained Regression | Jan 1, 2021 | MuJoCoregression | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Self-Supervised Continuous Control without Policy Gradient | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning | Jan 1, 2021 | Efficient ExplorationMuJoCo | —Unverified | 0 |
| Formal Language Constrained Markov Decision Processes | Jan 1, 2021 | MuJoCo | —Unverified | 0 |