SOTAVerified

Starcraft II

Starcraft II is a RTS game; the task is to train an agent to play the game.

( Image credit: The StarCraft Multi-Agent Challenge )

Papers

Showing 150 of 175 papers

TitleStatusHype
Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First TimeCode2
LLM-PySC2: Starcraft II learning environment for Large Language ModelsCode2
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement LearningCode2
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization ApproachCode2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
AlphaStar Unplugged: Large-Scale Offline Reinforcement LearningCode2
On Efficient Reinforcement Learning for Full-length Game of StarCraft IICode2
AVA: Attentive VLM Agent for Mastering StarCraft IICode1
Trajectory-Class-Aware Multi-Agent Reinforcement LearningCode1
Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy OptimizationCode1
Group-Aware Coordination Graph for Multi-Agent Reinforcement LearningCode1
N-Agent Ad Hoc TeamworkCode1
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language modelsCode1
FoX: Formation-aware exploration in multi-agent reinforcement learningCode1
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value RegularizationCode1
Semantic HELM: A Human-Readable Memory for Reinforcement LearningCode1
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?Code1
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement LearningCode1
Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information PrinciplesCode1
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority InfluenceCode1
TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning ProblemsCode1
Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraftCode1
SC2EGSet: StarCraft II Esport Replay and Game-state DatasetCode1
MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay BufferCode1
CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement LearningCode1
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC TasksCode1
Regularized Softmax Deep Multi-Agent Q-LearningCode1
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven ExplorationCode1
Coordinated Proximal Policy OptimizationCode1
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent DemonstrationsCode1
Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft IICode1
Rethinking of AlphaStarCode1
Perceiver IO: A General Architecture for Structured Inputs & OutputsCode1
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement LearningCode1
Context-Aware Sparse Deep Coordination GraphsCode1
Celebrating Diversity in Shared Multi-Agent Reinforcement LearningCode1
Cooperative Multi-Agent Reinforcement Learning with Sequential Credit AssignmentCode1
Gym-μRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement LearningCode1
An Introduction of mini-AlphaStarCode1
Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement LearningCode1
C-COMA: A CONTINUAL REINFORCEMENT LEARNING MODEL FOR DYNAMIC MULTIAGENT ENVIRONMENTSCode1
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent GamesCode1
Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement LearningCode1
TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full GameCode1
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement LearningCode1
Graph Convolutional Value Decomposition in Multi-Agent Reinforcement LearningCode1
RODE: Learning Roles to Decompose Multi-Agent TasksCode1
Energy-based Surprise Minimization for Multi-Agent Value FactorizationCode1
QPLEX: Duplex Dueling Multi-Agent Q-LearningCode1
Off-Policy Multi-Agent Decomposed Policy GradientsCode1
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.