| Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL | Apr 30, 2023 | Decision MakingMuJoCo | CodeCode Available | 1 |
| When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning | Feb 15, 2023 | Autonomous Drivingcontinuous-control | CodeCode Available | 1 |
| Order Matters: Agent-by-agent Policy Optimization | Feb 13, 2023 | MuJoCo | CodeCode Available | 1 |
| Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning | Feb 13, 2023 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners | Feb 3, 2023 | DiversityMuJoCo | CodeCode Available | 1 |
| Partial advantage estimator for proximal policy optimization | Jan 26, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 1 |
| Joint action loss for proximal policy optimization | Jan 26, 2023 | Dota 2MuJoCo | CodeCode Available | 1 |
| Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification | Nov 7, 2022 | MuJoCo | CodeCode Available | 1 |
| WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments | Oct 14, 2022 | Atari GamesBenchmarking | CodeCode Available | 1 |
| Monte Carlo Tree Search based Variable Selection for High Dimensional Bayesian Optimization | Oct 4, 2022 | Bayesian OptimizationMuJoCo | CodeCode Available | 1 |
| Short-Term Plasticity Neurons Learning to Learn and Forget | Jun 28, 2022 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk | Jun 9, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| ARLO: A Framework for Automated Reinforcement Learning | May 20, 2022 | feature selectionMuJoCo | CodeCode Available | 1 |
| Value Gradient weighted Model-Based Reinforcement Learning | Apr 4, 2022 | modelModel-based Reinforcement Learning | CodeCode Available | 1 |
| Deconstructing the Inductive Biases of Hamiltonian Neural Networks | Feb 10, 2022 | MuJoCo | CodeCode Available | 1 |
| Lipschitz-constrained Unsupervised Skill Discovery | Feb 2, 2022 | DiversityMuJoCo | CodeCode Available | 1 |
| SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning | Dec 31, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion | Dec 11, 2021 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Residual Pathway Priors for Soft Equivariance Constraints | Dec 2, 2021 | MuJoCo | CodeCode Available | 1 |
| Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | Dec 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| Offline Model-based Adaptable Policy Learning | Dec 1, 2021 | Decision Makingmodel | CodeCode Available | 1 |
| EDGE: Explaining Deep Reinforcement Learning Policies | Dec 1, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Generalized Decision Transformer for Offline Hindsight Information Matching | Nov 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Robust Deep Reinforcement Learning for Quadcopter Control | Nov 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Conditioning Sparse Variational Gaussian Processes for Online Decision-making | Oct 28, 2021 | Active LearningDecision Making | CodeCode Available | 1 |
| Multi-Agent Constrained Policy Optimisation | Oct 6, 2021 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning | Sep 23, 2021 | LEMMAMuJoCo | CodeCode Available | 1 |
| Settling the Variance of Multi-Agent Policy Gradients | Aug 19, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 |
| Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning | Jul 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Unsupervised Skill Discovery with Bottleneck Option Learning | Jun 27, 2021 | DisentanglementMuJoCo | CodeCode Available | 1 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk | Jun 18, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A Game-Theoretic Approach to Multi-Agent Trust Region Optimization | Jun 12, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL | Jun 9, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage | Jun 6, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture | May 28, 2021 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage | May 21, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with Pybullet | May 12, 2021 | MuJoCoMulti-Goal Reinforcement Learning | CodeCode Available | 1 |
| Generalizable Episodic Memory for Deep Reinforcement Learning | Mar 11, 2021 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| Model-free Policy Learning with Reward Gradients | Mar 9, 2021 | Continuous Controlmodel | CodeCode Available | 1 |
| Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments | Jan 20, 2021 | MuJoCo | CodeCode Available | 1 |
| Randomized Ensembled Double Q-Learning: Learning Fast Without a Model | Jan 15, 2021 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Cross-Modal Domain Adaptation for Reinforcement Learning | Jan 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| Multi-Agent Trust Region Learning | Jan 1, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Reset-Free Lifelong Learning with Skill-Space Planning | Dec 7, 2020 | Lifelong learningMuJoCo | CodeCode Available | 1 |
| RealAnt: An Open-Source Low-Cost Quadruped for Education and Research in Real-World Reinforcement Learning | Nov 5, 2020 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control | Oct 15, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Reinforcement Learning with Random Delays | Oct 6, 2020 | Anatomycontinuous-control | CodeCode Available | 1 |