SOTAVerified

Multi-Objective Reinforcement Learning

Papers

Showing 150 of 143 papers

TitleStatusHype
Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning ApproachCode2
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement LearningCode2
MO-Gym: A Library of Multi-Objective Reinforcement Learning EnvironmentsCode2
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto FrontCode1
Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz DominanceCode1
Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot ControlCode1
UCB-driven Utility Function Search for Multi-objective Reinforcement LearningCode1
A Review of the Deep Sea Treasure problem as a Multi-Objective Reinforcement Learning BenchmarkCode1
Distributional Pareto-Optimal Multi-Objective Reinforcement LearningCode1
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning AlgorithmCode1
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter MergingCode1
Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RLCode1
Multi-Objective Reinforcement Learning for Power Grid Topology ControlCode1
Formal Contracts Mitigate Social Dilemmas in Multi-Agent RLCode1
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human PreferencesCode1
GMOCAT: A Graph-Enhanced Multi-Objective Method for Computerized Adaptive TestingCode1
On Generalization Across Environments In Multi-Objective Reinforcement LearningCode1
Optimization of Molecules via Deep Reinforcement LearningCode1
Lexicographic Multi-Objective Reinforcement LearningCode1
Skill-based Multi-objective Reinforcement Learning of Industrial Robot Tasks with Planning and Knowledge IntegrationCode0
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement PrioritizationCode0
STEMO: Early Spatio-temporal Forecasting with Multi-Objective Reinforcement LearningCode0
Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-SnapshotsCode0
Dynamic Multi-Reward Weighting for Multi-Style Controllable GenerationCode0
Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement LearningCode0
Benchmarking MOEAs for solving continuous multi-objective RL problemsCode0
Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learningCode0
PSMGD: Periodic Stochastic Multi-Gradient Descent for Fast Multi-Objective OptimizationCode0
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement LearningCode0
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement LearningCode0
Exploring the Impact of Tunable Agents in Sequential Social DilemmasCode0
A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy AdaptationCode0
Piecewise-Stationary Multi-Objective Multi-Armed Bandit with Application to Joint Communications and SensingCode0
Multi-objective Reinforcement learning from AI FeedbackCode0
A Practical Guide to Multi-Objective Reinforcement Learning and PlanningCode0
Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement LearningCode0
PMGDA: A Preference-based Multiple Gradient Descent AlgorithmCode0
Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement LearningCode0
AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse Catalysts DesignCode0
Dynamic Weights in Multi-Objective Deep Reinforcement LearningCode0
Learning Pareto Set for Multi-Objective Continuous Robot ControlCode0
EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-TuningCode0
Mol-MoE: Training Preference-Guided Routers for Molecule GenerationCode0
Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance OptimizationCode0
gTLO: A Generalized and Non-linear Multi-Objective Deep Reinforcement Learning ApproachCode0
Active Sampling for MRI-based Sequential Decision MakingCode0
Collision Avoidance Robotics Via Meta-Learning (CARML)Code0
Inferring Preferences from Demonstrations in Multi-objective Reinforcement LearningCode0
Multi-Objective Deep Reinforcement LearningCode0
Policy-regularized Offline Multi-objective Reinforcement LearningCode0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.