SOTAVerified

Starcraft

Starcraft I is a RTS game; the task is to train an agent to play the game.

( Image credit: Macro Action Selection with Deep Reinforcement Learning in StarCraft )

Papers

Showing 201250 of 311 papers

TitleStatusHype
DOP: Off-Policy Multi-Agent Decomposed Policy Gradients0
Coordinated Multi-Agent Exploration Using Shared Goals0
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms0
FSV: Learning to Factorize Soft Value Function for Cooperative Multi-Agent Reinforcement Learning0
Factored Action Spaces in Deep Reinforcement Learning0
SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II0
QVMix and QVMix-Max: Extending the Deep Quality-Value Family of Algorithms to Cooperative Multi-Agent Reinforcement LearningCode0
Exact Reduction of Huge Action Spaces in General Reinforcement Learning0
Reinforcement Learning for the Beginning of Starcraft II Game0
Multi-agent Policy Optimization with Approximatively Synchronous Advantage Estimation0
TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full GameCode1
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement LearningCode1
Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?Code1
Multi-Agent Collaboration via Reward Attribution DecompositionCode1
Graph Convolutional Value Decomposition in Multi-Agent Reinforcement LearningCode1
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning0
RODE: Learning Roles to Decompose Multi-Agent TasksCode1
Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning0
AI and Wargaming0
Energy-based Surprise Minimization for Multi-Agent Value FactorizationCode1
QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning0
BGC: Multi-Agent Group Belief with Graph Clustering0
Hierarchical Reinforcement Learning in StarCraft II with Human Expertise in Subgoals Selection0
QPLEX: Duplex Dueling Multi-Agent Q-LearningCode1
Improving Multi-Agent Cooperation using Theory of Mind0
Off-Policy Multi-Agent Decomposed Policy GradientsCode1
Value-Decomposition Multi-Agent Actor-CriticsCode1
Artificial Intelligence is stupid and causal reasoning won't fix it0
QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning0
Deep Implicit Coordination Graphs for Multi-agent Reinforcement LearningCode1
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement LearningCode1
Lifelong Learning using Eigentasks: Task Separation, Skill Acquisition and Selective Transfer0
StarCraft II Build Order Optimization using Deep Reinforcement Learning and Monte-Carlo Tree Search0
Learning to Play No-Press Diplomacy with Best Response Policy IterationCode1
Incorporating Pragmatic Reasoning Communication into Emergent Language0
Randomized Entity-wise Factorization for Multi-Agent Reinforcement LearningCode1
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization0
The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems0
Optimal Any-Angle Pathfinding on a Sphere0
Real World Games Look Like Spinning TopsCode1
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning0
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement LearningCode2
FACMAC: Factored Multi-Agent Centralised Policy GradientsCode1
DefogGAN: Predicting Hidden Information in the StarCraft Fog of War with Generative Adversarial NetsCode1
From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI0
A Limited-Capacity Minimax Theorem for Non-Convex Games or: How I Learned to Stop Worrying about Mixed-Nash and Love Neural Nets0
Heterogeneous Learning from Demonstration0
Reinforcement Learning-based Application Autoscaling in the Cloud: A Survey0
Meta Reinforcement Learning with Autonomous Inference of Subtask DependenciesCode1
LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement LearningCode1
Show:102550
← PrevPage 5 of 7Next →

No leaderboard results yet.