SOTAVerified

Multi-agent Reinforcement Learning

The target of Multi-agent Reinforcement Learning is to solve complex problems by integrating multiple agents that focus on different sub-tasks. In general, there are two types of multi-agent systems: independent and cooperative systems.

Source: Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports

Papers

Showing 851900 of 1718 papers

TitleStatusHype
Offline Pre-trained Multi-Agent Decision Transformer0
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration0
Off-Policy Action Anticipation in Multi-Agent Reinforcement Learning0
Offsetting Unequal Competition through RL-assisted Incentive Schemes0
On Diagnostics for Understanding Agent Training Behaviour in Cooperative MARL0
Data Poisoning to Fake a Nash Equilibrium in Markov Games0
On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality0
Online and Bandit Algorithms for Nonstationary Stochastic Saddle-Point Optimization0
Online Location Planning for AI-Defined Vehicles: Optimizing Joint Tasks of Order Serving and Spatio-Temporal Heterogeneous Model Fine-Tuning0
Online Multi-agent Reinforcement Learning for Decentralized Inverter-based Volt-VAR Control0
Online Tuning for Offline Decentralized Multi-Agent Reinforcement Learning0
On Memory Mechanism in Multi-Agent Reinforcement Learning0
On Solving Cooperative MARL Problems with a Few Good Experiences0
On Stateful Value Factorization in Multi-Agent Reinforcement Learning0
On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)0
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games0
On the Complexity of Multi-Agent Decision Making: From Learning in Games to Partial Monitoring0
On the Convergence of Consensus Algorithms with Markovian Noise and Gradient Bias0
On Gradient-Based Learning in Continuous Games0
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination0
On the Hardness of Decentralized Multi-Agent Policy Evaluation under Byzantine Attacks0
On the Near-Optimality of Local Policies in Large Cooperative Multi-Agent Reinforcement Learning0
On the Role of Emergent Communication for Social Learning in Multi-Agent Reinforcement Learning0
Ontology-driven Reinforcement Learning for Personalized Student Support0
Optimal Lattice Boltzmann Closures through Multi-Agent Reinforcement Learning0
Optimal Path Planning and Cost Minimization for a Drone Delivery System Via Model Predictive Control0
Optimising Energy Efficiency in UAV-Assisted Networks using Deep Reinforcement Learning0
Optimistic ε-Greedy Exploration for Cooperative Multi-Agent Reinforcement Learning0
Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents0
Optimization of Image Transmission in a Cooperative Semantic Communication Networks0
Optimizing Market Making using Multi-Agent Reinforcement Learning0
Options as responses: Grounding behavioural hierarchies in multi-agent RL0
OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning0
Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning0
OrbitZoo: Multi-Agent Reinforcement Learning Environment for Orbital Dynamics0
Order book regulatory impact on stock market quality: a multi-agent reinforcement learning perspective0
“Other-Play” for Zero-Shot Coordination0
PAC Guarantees for Cooperative Multi-Agent Reinforcement Learning with Restricted Communication0
Packet Routing with Graph Attention Multi-agent Reinforcement Learning0
PAC Reinforcement Learning Algorithm for General-Sum Markov Games0
Parallel Knowledge Transfer in Multi-Agent Reinforcement Learning0
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning0
Parameter Sharing with Network Pruning for Scalable Multi-Agent Deep Reinforcement Learning0
Partially Observable Multi-Agent Reinforcement Learning with Information Sharing0
PathSeeker: Exploring LLM Security Vulnerabilities with a Reinforcement Learning-Based Jailbreak Approach0
Paths to Equilibrium in Games0
PEnGUiN: Partially Equivariant Graph NeUral Networks for Sample Efficient MARL0
Sable: a Performant, Efficient and Scalable Sequence Model for MARL0
Perimeter Control with Heterogeneous Metering Rates for Cordon Signals: A Physics-Regularized Multi-Agent Reinforcement Learning Approach0
Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach0
Show:102550
← PrevPage 18 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MATD3final agent reward-14Unverified
#ModelMetricClaimedVerifiedStatus
1DRIMAMedian Win Rate15Unverified
#ModelMetricClaimedVerifiedStatus
1Fusion-Multi-Actor-Attention-CriticAverage Reward39Unverified