Cooperative Actor-Critic via TD Error Aggregation Jul 25, 2022 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Few-Shot Teamwork Jul 19, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework Jul 12, 2022 Multi-agent Reinforcement Learning Policy Gradient Methods
— Unverified 0Reward-Sharing Relational Networks in Multi-Agent Reinforcement Learning as a Framework for Emergent Behavior Jul 12, 2022 Multi-agent Reinforcement Learning Sociology
— Unverified 0Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning Jul 8, 2022 Diversity Multi-agent Reinforcement Learning
Code Code Available 1High Performance Simulation for Scalable Multi-Agent Reinforcement Learning Jul 8, 2022 GPU Multi-agent Reinforcement Learning
— Unverified 0VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning Jul 7, 2022 Benchmarking Multi-agent Reinforcement Learning
Code Code Available 2Decentralized scheduling through an adaptive, trading-based multi-agent system Jul 5, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning Jul 5, 2022 Decoder Multi-agent Reinforcement Learning
Code Code Available 1The StarCraft Multi-Agent Challenges+ : Learning of Multi-Stage Tasks and Environmental Factors without Precise Reward Functions Jul 5, 2022 Multi-agent Reinforcement Learning SMAC+
Code Code Available 1DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems Jun 28, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0EMVLight: a Multi-agent Reinforcement Learning Framework for an Emergency Vehicle Decentralized Routing and Traffic Signal Control System Jun 27, 2022 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Functional Optimization Reinforcement Learning for Real-Time Bidding Jun 25, 2022 Attribute Multi-agent Reinforcement Learning
— Unverified 0Toward multi-target self-organizing pursuit in a partially observable Markov game Jun 24, 2022 Decision Making Deep Reinforcement Learning
Code Code Available 1PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning Jun 22, 2022 counterfactual Multi-agent Reinforcement Learning
Code Code Available 0Certifiably Robust Policy Learning against Adversarial Communication in Multi-agent Systems Jun 21, 2022 Multi-agent Reinforcement Learning
— Unverified 0MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer Jun 20, 2022 Multi-agent Reinforcement Learning Q-Learning
Code Code Available 1S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning? Jun 20, 2022 All Multi-agent Reinforcement Learning
— Unverified 0From Multi-agent to Multi-robot: A Scalable Training and Evaluation Platform for Multi-robot Reinforcement Learning Jun 20, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Logic-based Reward Shaping for Multi-Agent Reinforcement Learning Jun 17, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning Jun 15, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Universally Expressive Communication in Multi-Agent Reinforcement Learning Jun 14, 2022 Graph Learning Multi-agent Reinforcement Learning
Code Code Available 0Multi-Agent Neural Rewriter for Vehicle Routing with Limited Disclosure of Costs Jun 13, 2022 Multi-agent Reinforcement Learning
— Unverified 0Finite-Time Analysis of Fully Decentralized Single-Timescale Actor-Critic Jun 12, 2022 Multi-agent Reinforcement Learning Privacy Preserving
— Unverified 0Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based on Maximum Entropy Jun 10, 2022 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0