SOTAVerified

Multi-agent Reinforcement Learning

The target of Multi-agent Reinforcement Learning is to solve complex problems by integrating multiple agents that focus on different sub-tasks. In general, there are two types of multi-agent systems: independent and cooperative systems.

Source: Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports

Papers

Showing 326350 of 1718 papers

TitleStatusHype
MolOpt: Autonomous Molecular Geometry Optimization using Multi-Agent Reinforcement LearningCode0
RGMComm: Return Gap Minimization via Discrete Communications in Multi-Agent Reinforcement LearningCode0
What is Local Optimality in Nonconvex-Nonconcave Minimax Optimization?Code0
MDPGT: Momentum-based Decentralized Policy Gradient TrackingCode0
Mean-Field Control based Approximation of Multi-Agent Reinforcement Learning in Presence of a Non-decomposable Shared Global StateCode0
A Deep Multi-Agent Reinforcement Learning Approach to Autonomous Separation AssuranceCode0
Biological Pathway Guided Gene Selection Through Collaborative Reinforcement LearningCode0
MAVEN: Multi-Agent Variational ExplorationCode0
Measuring Policy Distance for Multi-Agent Reinforcement LearningCode0
MAC-PO: Multi-Agent Experience Replay via Collective Priority OptimizationCode0
M^3RL: Mind-aware Multi-agent Management Reinforcement LearningCode0
MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective IntelligenceCode0
Light Aircraft Game : Basic Implementation and training results analysisCode0
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement LearningCode0
Logic-based Reward Shaping for Multi-Agent Reinforcement LearningCode0
MAHTM: A Multi-Agent Framework for Hierarchical Transactive MicrogridsCode0
Mediated Multi-Agent Reinforcement LearningCode0
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive ReasoningCode0
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided MarketsCode0
Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel DecodingCode0
Learning to Gather without CommunicationCode0
Learning to Schedule Communication in Multi-agent Reinforcement LearningCode0
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive EnvironmentsCode0
Balancing Performance and Cost for Two-Hop Cooperative Communications: Stackelberg Game and Distributed Multi-Agent Reinforcement LearningCode0
Learning to Share and Hide Intentions using Information RegularizationCode0
Show:102550
← PrevPage 14 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MATD3final agent reward-14Unverified
#ModelMetricClaimedVerifiedStatus
1DRIMAMedian Win Rate15Unverified
#ModelMetricClaimedVerifiedStatus
1Fusion-Multi-Actor-Attention-CriticAverage Reward39Unverified