SOTAVerified

Starcraft

Starcraft I is a RTS game; the task is to train an agent to play the game.

( Image credit: Macro Action Selection with Deep Reinforcement Learning in StarCraft )

Papers

Showing 51100 of 311 papers

TitleStatusHype
MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning0
COA-GPT: Generative Pre-trained Transformers for Accelerated Course of Action Development in Military Operations0
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language modelsCode1
BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions0
Innate-Values-driven Reinforcement Learning based Cooperative Multi-Agent Cognitive Modeling0
StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments0
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization ApproachCode2
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning0
CODEX: A Cluster-Based Method for Explainable Reinforcement LearningCode0
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play0
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning0
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization0
Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning0
Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning0
Leveraging World Model Disentanglement in Value-Based Multi-Agent Reinforcement Learning0
FoX: Formation-aware exploration in multi-agent reinforcement learningCode1
Never Explore Repeatedly in Multi-Agent Reinforcement Learning0
AlphaStar Unplugged: Large-Scale Offline Reinforcement LearningCode2
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value RegularizationCode1
Transferable Curricula through Difficulty Conditioned Generators0
Anticipatory Thinking Challenges in Open Worlds: Risk Management0
Maximum Entropy Heterogeneous-Agent Reinforcement LearningCode2
Semantic HELM: A Human-Readable Memory for Reinforcement LearningCode1
Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization0
A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement LearningCode0
EXPODE: EXploiting POlicy Discrepancy for Efficient Exploration in Multi-agent Reinforcement LearningCode0
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?Code1
Boosting Value Decomposition via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning0
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement LearningCode1
Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information PrinciplesCode1
SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning0
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement LearningCode0
MAC-PO: Multi-Agent Experience Replay via Collective Priority OptimizationCode0
AIIR-MIX: Multi-Agent Reinforcement Learning Meets Attention Individual Intrinsic Reward Mixing Network0
ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning0
Equivariant MuZero0
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority InfluenceCode1
PushWorld: A benchmark for manipulation planning with tools and movable obstaclesCode1
TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning ProblemsCode1
Self-Motivated Multi-Agent ExplorationCode0
Strangeness-driven Exploration in Multi-Agent Reinforcement LearningCode0
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement LearningCode2
Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning0
System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games0
CURO: Curriculum Learning for Relative Overgeneralization0
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning0
Decision-making with Speculative Opponent ModelsCode0
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team CompetitionCode0
Show:102550
← PrevPage 2 of 7Next →

No leaderboard results yet.