| MO-Gym: A Library of Multi-Objective Reinforcement Learning Environments | Nov 30, 2022 | Multi-Objective Reinforcement LearningOpenAI Gym | CodeCode Available | 2 | 5 |
| A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning | Sep 26, 2023 | BenchmarkingMulti-Objective Reinforcement Learning | CodeCode Available | 2 | 5 |
| Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning Approach | Sep 24, 2024 | Multi-Objective Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| A Review of the Deep Sea Treasure problem as a Multi-Objective Reinforcement Learning Benchmark | Oct 13, 2021 | Multi-Objective Reinforcement Learning | CodeCode Available | 1 | 5 |
| GMOCAT: A Graph-Enhanced Multi-Objective Method for Computerized Adaptive Testing | Oct 11, 2023 | DiversityGraph Neural Network | CodeCode Available | 1 | 5 |
| Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz Dominance | Nov 27, 2024 | FairnessMulti-Objective Reinforcement Learning | CodeCode Available | 1 | 5 |
| Multi-Objective Reinforcement Learning for Power Grid Topology Control | Jan 27, 2025 | Multi-Objective Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm | Aug 16, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences | Dec 14, 2023 | Multi-Objective Reinforcement LearningRobot Navigation | CodeCode Available | 1 | 5 |
| C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front | Oct 3, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| UCB-driven Utility Function Search for Multi-objective Reinforcement Learning | May 1, 2024 | Decision MakingMuJoCo | CodeCode Available | 1 | 5 |
| Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL | Aug 22, 2022 | ManagementMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control | Jan 1, 2020 | Multi-Objective Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| On Generalization Across Environments In Multi-Objective Reinforcement Learning | Mar 2, 2025 | Decision MakingMulti-Objective Reinforcement Learning | CodeCode Available | 1 | 5 |
| Distributional Pareto-Optimal Multi-Objective Reinforcement Learning | Sep 21, 2023 | Autonomous DrivingMulti-Objective Reinforcement Learning | CodeCode Available | 1 | 5 |
| Lexicographic Multi-Objective Reinforcement Learning | Dec 28, 2022 | Multi-Objective Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL | Apr 30, 2023 | Decision MakingMuJoCo | CodeCode Available | 1 | 5 |
| Optimization of Molecules via Deep Reinforcement Learning | Oct 19, 2018 | Deep Reinforcement LearningMolecular Graph Generation | CodeCode Available | 1 | 5 |
| Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning | Jun 10, 2022 | FairnessMulti-Objective Reinforcement Learning | CodeCode Available | 0 | 5 |
| A Distributional View on Multi-Objective Policy Optimization | May 15, 2020 | Multi-Objective Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning | Sep 30, 2024 | Decision MakingMulti-Objective Reinforcement Learning | CodeCode Available | 0 | 5 |
| gTLO: A Generalized and Non-linear Multi-Objective Deep Reinforcement Learning Approach | Apr 11, 2022 | Deep Reinforcement LearningDeep-Sea Treasure, Image version | CodeCode Available | 0 | 5 |
| Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning | Nov 25, 2020 | Multi-Objective Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Hyperparameter Optimization for Multi-Objective Reinforcement Learning | Oct 25, 2023 | Hyperparameter OptimizationMulti-Objective Reinforcement Learning | CodeCode Available | 0 | 5 |