SOTAVerified

Multi-agent Reinforcement Learning

The target of Multi-agent Reinforcement Learning is to solve complex problems by integrating multiple agents that focus on different sub-tasks. In general, there are two types of multi-agent systems: independent and cooperative systems.

Source: Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports

Papers

Showing 16761700 of 1718 papers

TitleStatusHype
Learning Existing Social Conventions via Observationally Augmented Self-Play0
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization0
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients0
On Gradient-Based Learning in Continuous Games0
Emergent Communication through Negotiation0
Towards Learning Transferable Conversational Skills using Multi-dimensional Dialogue ModellingCode0
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement LearningCode1
Entropy based Independent Learning in Anonymous Multi-Agent Settings0
Inequity aversion improves cooperation in intertemporal social dilemmasCode1
Valuing knowledge, information and agency in Multi-agent Reinforcement Learning: a case study in smart buildings0
Intent-aware Multi-agent Reinforcement Learning0
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising0
Modeling Others using Oneself in Multi-Agent Reinforcement Learning0
Fully Decentralized Multi-Agent Reinforcement Learning with Networked AgentsCode1
Asynchronous stochastic approximations with asymptotically biased errors and deep multi-agent learning0
Learning to Gather without CommunicationCode0
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet ManagementCode0
Mean Field Multi-Agent Reinforcement LearningCode1
Cooperative Multi-Agent Reinforcement Learning for Low-Level Wireless Communication0
Neuron as an Agent0
MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective IntelligenceCode0
Routing Networks: Adaptive Selection of Non-linear Functions for Multi-Task Learning0
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning0
Learning with Opponent-Learning AwarenessCode0
Prosocial learning agents solve generalized Stag Hunts better than selfish onesCode0
Show:102550
← PrevPage 68 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MATD3final agent reward-14Unverified
#ModelMetricClaimedVerifiedStatus
1DRIMAMedian Win Rate15Unverified
#ModelMetricClaimedVerifiedStatus
1Fusion-Multi-Actor-Attention-CriticAverage Reward39Unverified