| Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL | Apr 30, 2023 | Decision MakingMuJoCo | CodeCode Available | 1 |
| When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning | Feb 15, 2023 | Autonomous Drivingcontinuous-control | CodeCode Available | 1 |
| Order Matters: Agent-by-agent Policy Optimization | Feb 13, 2023 | MuJoCo | CodeCode Available | 1 |
| Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning | Feb 13, 2023 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners | Feb 3, 2023 | DiversityMuJoCo | CodeCode Available | 1 |
| Joint action loss for proximal policy optimization | Jan 26, 2023 | Dota 2MuJoCo | CodeCode Available | 1 |
| Partial advantage estimator for proximal policy optimization | Jan 26, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 1 |
| Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification | Nov 7, 2022 | MuJoCo | CodeCode Available | 1 |
| WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments | Oct 14, 2022 | Atari GamesBenchmarking | CodeCode Available | 1 |