SOTAVerified

Multi-agent Reinforcement Learning

The target of Multi-agent Reinforcement Learning is to solve complex problems by integrating multiple agents that focus on different sub-tasks. In general, there are two types of multi-agent systems: independent and cooperative systems.

Source: Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports

Papers

Showing 150 of 1718 papers

TitleStatusHype
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion PlanningCode4
Unreal-MAP: Unreal-Engine-Based General Platform for Multi-Agent Reinforcement LearningCode3
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and EvaluationCode3
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement LearningCode3
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning LibraryCode3
SustainDC: Benchmarking for Sustainable Data Center ControlCode2
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement LearningCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
Multi-Agent Reinforcement Learning is a Sequence Modeling ProblemCode2
ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot CoordinationCode2
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social DilemmasCode2
Tactics2D: A Highly Modular and Extensible Simulator for Driving Decision-makingCode2
PettingZoo: Gym for Multi-Agent Reinforcement LearningCode2
Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement LearningCode2
Off-the-Grid MARL: Datasets with Baselines for Offline Multi-Agent Reinforcement LearningCode2
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning BenchmarksCode2
Heterogeneous-Agent Reinforcement LearningCode2
MAexp: A Generic Platform for RL-based Multi-Agent ExplorationCode2
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement LearningCode2
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement LearningCode2
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A SurveyCode2
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement LearningCode2
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement LearningCode2
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous DrivingCode2
Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement LearningCode2
Digital Twin Vehicular Edge Computing Network: Task Offloading and Resource AllocationCode2
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot LearningCode2
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement LearningCode2
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement LearningCode2
Deep Reinforcement Learning for Multi-Agent InteractionCode2
Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object TrackingCode2
Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk ManagementCode2
Emergent Reciprocity and Team Formation from Randomized Uncertain Social PreferencesCode2
Ensembling Prioritized Hybrid Policies for Multi-agent PathfindingCode2
Heterogeneous Multi-Robot Reinforcement LearningCode2
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language ModelsCode2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter ControlCode2
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement LearningCode2
Maximum Entropy Heterogeneous-Agent Reinforcement LearningCode2
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement LearningCode2
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement LearningCode2
A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online AdvertisingCode1
A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal ControlCode1
Cooperation and Fairness in Multi-Agent Reinforcement LearningCode1
Contrastive Identity-Aware Learning for Multi-Agent Value DecompositionCode1
Context-aware Communication for Multi-agent Reinforcement LearningCode1
Show:102550
← PrevPage 1 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MATD3final agent reward-14Unverified
#ModelMetricClaimedVerifiedStatus
1DRIMAMedian Win Rate15Unverified
#ModelMetricClaimedVerifiedStatus
1Fusion-Multi-Actor-Attention-CriticAverage Reward39Unverified