SOTAVerified

SMAC

Bechmarks for Efficient Exploration of Completion of Multi-stage Tasks and Usage of Environmental Factors

Papers

Showing 51100 of 121 papers

TitleStatusHype
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World ModelsCode0
Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning0
Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration0
Better Understandings and Configurations in MaxSAT Local Search Solvers via Anytime Performance Analysis0
PPS-QMIX: Periodically Parameter Sharing for Accelerating Convergence of Multi-Agent Reinforcement LearningCode0
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning0
Aligning Individual and Collective Objectives in Multi-Agent Cooperation0
Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation0
MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning0
Poisson Process for Bayesian Optimization0
T2MAC: Targeted and Trusted Multi-Agent Communication through Selective Engagement and Evidence-Driven Integration0
AgentMixer: Multi-Agent Correlated Policy Factorization0
Innate-Values-driven Reinforcement Learning based Cooperative Multi-Agent Cognitive Modeling0
How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning0
QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning0
MaskMA: Towards Zero-Shot Multi-Agent Decision Making with Mask-Based Collaborative Learning0
Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning0
A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement LearningCode0
Research on Multi-Agent Communication and Collaborative Decision-Making Based on Deep Reinforcement Learning0
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning0
Automated classification of pre-defined movement patterns: A comparison between GNSS and UWB technology0
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement LearningCode0
Self-Motivated Multi-Agent ExplorationCode0
Effects of Spectral Normalization in Multi-agent Reinforcement LearningCode0
Decision-making with Speculative Opponent ModelsCode0
Contextual Transformer for Offline Meta Reinforcement Learning0
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning0
Maximum Correntropy Value Decomposition for Multi-agent Deep Reinforcemen Learning0
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL0
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning0
Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent Reinforcement Learning0
Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework0
Depthwise Convolution for Multi-Agent Communication with Enhanced Mean-Field Approximation0
Exploiting Semantic Epsilon Greedy Exploration Strategy in Multi-Agent Reinforcement Learning0
A Comparative study of Hyper-Parameter Optimization Tools0
Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning0
Cooperative Multi-Agent Reinforcement Learning with Hypergraph ConvolutionCode0
State-based Episodic Memory for Multi-Agent Reinforcement Learning0
Role Diversity Matters: A Study of Cooperative Training Strategies for Multi-Agent RL0
Revisiting the Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning0
MARNET: Backdoor Attacks against Value-Decomposition Multi-Agent Reinforcement Learning0
SMAC-Seg: LiDAR Panoptic Segmentation via Sparse Multi-directional Attention Clustering0
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning0
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning0
Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients0
NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning0
Robust Multi-Agent Reinforcement Learning Driven by Correlated Equilibrium0
Coordinated Multi-Agent Exploration Using Shared Goals0
UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers0
QVMix and QVMix-Max: Extending the Deep Quality-Value Family of Algorithms to Cooperative Multi-Agent Reinforcement LearningCode0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACEMedian Win Rate100Unverified
2DDNMedian Win Rate97.22Unverified
3DPLEXMedian Win Rate96.88Unverified
4QPLEXMedian Win Rate96.88Unverified
5DMIXMedian Win Rate95.11Unverified
6QMIXMedian Win Rate92.44Unverified
7VDNMedian Win Rate89.2Unverified
8DIQLMedian Win Rate85.23Unverified
9QMIXMedian Win Rate69Unverified
10QMIXMedian Win Rate69Unverified
#ModelMetricClaimedVerifiedStatus
1ACEMedian Win Rate100Unverified
2DDNMedian Win Rate94.03Unverified
3DMIXMedian Win Rate91.08Unverified
4DPLEXMedian Win Rate90.62Unverified
5VDNMedian Win Rate89.2Unverified
6QPLEXMedian Win Rate84.38Unverified
7QMIXMedian Win Rate67.22Unverified
8DIQLMedian Win Rate62.22Unverified
9IQLMedian Win Rate29.83Unverified
10QMIXMedian Win Rate2Unverified
#ModelMetricClaimedVerifiedStatus
1ACEMedian Win Rate100Unverified
2DDNMedian Win Rate95.4Unverified
3DIQLMedian Win Rate91.62Unverified
4DMIXMedian Win Rate90.45Unverified
5VDNMedian Win Rate85.34Unverified
6IQLMedian Win Rate84.87Unverified
7DPLEXMedian Win Rate81.25Unverified
8QPLEXMedian Win Rate75Unverified
9QMIXMedian Win Rate37.61Unverified
10QMIXMedian Win Rate1Unverified
#ModelMetricClaimedVerifiedStatus
1ACEMedian Win Rate93.75Unverified
2DDNMedian Win Rate83.92Unverified
3DMIXMedian Win Rate49.43Unverified
4DPLEXMedian Win Rate43.75Unverified
5QPLEXAverage Score15.95Unverified
6QMIXMedian Win Rate12.78Unverified
7QMIXMedian Win Rate3Unverified
8QMIXMedian Win Rate3Unverified
9HeuristicMedian Win Rate0Unverified
10VDNMedian Win Rate0Unverified
#ModelMetricClaimedVerifiedStatus
1DDNMedian Win Rate91.48Unverified
2DPLEXMedian Win Rate90.62Unverified
3DMIXMedian Win Rate85.45Unverified
4QMIXMedian Win Rate84.77Unverified
5QPLEXMedian Win Rate78.12Unverified
6VDNMedian Win Rate63.12Unverified
7QMIXMedian Win Rate49Unverified
8QMIXMedian Win Rate49Unverified
9DIQLMedian Win Rate6.02Unverified
10IQLMedian Win Rate2.27Unverified
#ModelMetricClaimedVerifiedStatus
1DMIXAverage Score19.17Unverified
2QPLEXAverage Score18.66Unverified
3DDNAverage Score18.49Unverified
4DPLEXAverage Score18.49Unverified
5QMIXAverage Score18.23Unverified
6VDNAverage Score16.69Unverified
#ModelMetricClaimedVerifiedStatus
1DDNAverage Score19.65Unverified
2DMIXAverage Score18.61Unverified
3VDNAverage Score17.16Unverified
4DPLEXAverage Score14.99Unverified
5QPLEXAverage Score13.6Unverified
6QMIXAverage Score13.09Unverified
#ModelMetricClaimedVerifiedStatus
1DDNAverage Score16Unverified
2DPLEXAverage Score14.84Unverified
3QPLEXAverage Score13.86Unverified
4DMIXAverage Score13.73Unverified
5VDNAverage Score13.57Unverified
6QMIXAverage Score12.37Unverified
#ModelMetricClaimedVerifiedStatus
1DDNAverage Score11.1Unverified
2DPLEXAverage Score10.71Unverified
3VDNAverage Score7.78Unverified
4DMIXAverage Score7.41Unverified
5QPLEXAverage Score6.44Unverified
6QMIXAverage Score4.8Unverified
#ModelMetricClaimedVerifiedStatus
1DDNAverage Score16.5Unverified
2DMIXAverage Score16.24Unverified
3DPLEXAverage Score15.89Unverified
4QPLEXAverage Score15.52Unverified
5QMIXAverage Score14.4Unverified
6VDNAverage Score13.13Unverified
#ModelMetricClaimedVerifiedStatus
1DDNAverage Score19.45Unverified
2DPLEXAverage Score19.4Unverified
3DMIXAverage Score19.33Unverified
4QPLEXAverage Score19.06Unverified
5QMIXAverage Score19.01Unverified
6VDNAverage Score17.3Unverified