SOTAVerified

Multi-agent Reinforcement Learning

The target of Multi-agent Reinforcement Learning is to solve complex problems by integrating multiple agents that focus on different sub-tasks. In general, there are two types of multi-agent systems: independent and cooperative systems.

Source: Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports

Papers

Showing 16511700 of 1718 papers

TitleStatusHype
Multi-agent Deep Reinforcement Learning with Extremely Noisy Observations0
Deep Multi-Agent Reinforcement Learning with Relevance GraphsCode0
Emergence of linguistic conventions in multi-agent reinforcement learning0
Coordinating Disaster Emergency Response with Heuristic Reinforcement Learning0
Bayesian Action Decoder for Deep Multi-Agent Reinforcement LearningCode1
Multi-Agent Common Knowledge Reinforcement LearningCode0
TarMAC: Targeted Multi-Agent Communication0
Multi-Agent Reinforcement Learning Based Resource Allocation for UAV Networks0
Multi-Agent Actor-Critic with Generative Cooperative Policy Network0
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement LearningCode1
Actor-Attention-Critic for Multi-Agent Reinforcement LearningCode1
M^3RL: Mind-aware Multi-agent Management Reinforcement LearningCode0
A Better Baseline for Second Order Gradient Estimation in Stochastic Computation Graphs0
Learning through Probing: a decentralized reinforcement learning architecture for social dilemmas0
IntelligentCrowd: Mobile Crowdsensing via Multi-Agent Reinforcement Learning0
Prosocial or Selfish? Agents with different behaviors for Contract Negotiation using Reinforcement Learning0
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning0
Negative Update Intervals in Deep Multi-Agent Reinforcement LearningCode1
Coordination-driven learning in multi-agent problem spaces0
CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement LearningCode0
A Multi-Agent Reinforcement Learning Method for Impression Allocation in Online Display Advertising0
MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures0
Learning to Share and Hide Intentions using Information RegularizationCode0
Multi-Agent Reinforcement Learning: A Report on Challenges and ApproachesCode0
Learning to Act in Decentralized Partially Observable MDPs0
Learning Existing Social Conventions via Observationally Augmented Self-Play0
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization0
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients0
On Gradient-Based Learning in Continuous Games0
Emergent Communication through Negotiation0
Towards Learning Transferable Conversational Skills using Multi-dimensional Dialogue ModellingCode0
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement LearningCode1
Entropy based Independent Learning in Anonymous Multi-Agent Settings0
Inequity aversion improves cooperation in intertemporal social dilemmasCode1
Valuing knowledge, information and agency in Multi-agent Reinforcement Learning: a case study in smart buildings0
Intent-aware Multi-agent Reinforcement Learning0
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising0
Modeling Others using Oneself in Multi-Agent Reinforcement Learning0
Fully Decentralized Multi-Agent Reinforcement Learning with Networked AgentsCode1
Asynchronous stochastic approximations with asymptotically biased errors and deep multi-agent learning0
Learning to Gather without CommunicationCode0
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet ManagementCode0
Mean Field Multi-Agent Reinforcement LearningCode1
Cooperative Multi-Agent Reinforcement Learning for Low-Level Wireless Communication0
Neuron as an Agent0
MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective IntelligenceCode0
Routing Networks: Adaptive Selection of Non-linear Functions for Multi-Task Learning0
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning0
Learning with Opponent-Learning AwarenessCode0
Prosocial learning agents solve generalized Stag Hunts better than selfish onesCode0
Show:102550
← PrevPage 34 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MATD3final agent reward-14Unverified
#ModelMetricClaimedVerifiedStatus
1DRIMAMedian Win Rate15Unverified
#ModelMetricClaimedVerifiedStatus
1Fusion-Multi-Actor-Attention-CriticAverage Reward39Unverified