| Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL | Apr 30, 2023 | Decision MakingMuJoCo | CodeCode Available | 1 |
| When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning | Feb 15, 2023 | Autonomous Drivingcontinuous-control | CodeCode Available | 1 |
| Order Matters: Agent-by-agent Policy Optimization | Feb 13, 2023 | MuJoCo | CodeCode Available | 1 |
| Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning | Feb 13, 2023 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners | Feb 3, 2023 | DiversityMuJoCo | CodeCode Available | 1 |
| Partial advantage estimator for proximal policy optimization | Jan 26, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 1 |
| Joint action loss for proximal policy optimization | Jan 26, 2023 | Dota 2MuJoCo | CodeCode Available | 1 |
| Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification | Nov 7, 2022 | MuJoCo | CodeCode Available | 1 |
| WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments | Oct 14, 2022 | Atari GamesBenchmarking | CodeCode Available | 1 |
| Monte Carlo Tree Search based Variable Selection for High Dimensional Bayesian Optimization | Oct 4, 2022 | Bayesian OptimizationMuJoCo | CodeCode Available | 1 |
| Short-Term Plasticity Neurons Learning to Learn and Forget | Jun 28, 2022 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk | Jun 9, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| ARLO: A Framework for Automated Reinforcement Learning | May 20, 2022 | feature selectionMuJoCo | CodeCode Available | 1 |
| Value Gradient weighted Model-Based Reinforcement Learning | Apr 4, 2022 | modelModel-based Reinforcement Learning | CodeCode Available | 1 |
| Deconstructing the Inductive Biases of Hamiltonian Neural Networks | Feb 10, 2022 | MuJoCo | CodeCode Available | 1 |
| Lipschitz-constrained Unsupervised Skill Discovery | Feb 2, 2022 | DiversityMuJoCo | CodeCode Available | 1 |
| SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning | Dec 31, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion | Dec 11, 2021 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Residual Pathway Priors for Soft Equivariance Constraints | Dec 2, 2021 | MuJoCo | CodeCode Available | 1 |
| Offline Model-based Adaptable Policy Learning | Dec 1, 2021 | Decision Makingmodel | CodeCode Available | 1 |
| Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | Dec 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| EDGE: Explaining Deep Reinforcement Learning Policies | Dec 1, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Generalized Decision Transformer for Offline Hindsight Information Matching | Nov 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Robust Deep Reinforcement Learning for Quadcopter Control | Nov 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |