Effective Multi-Agent Deep Reinforcement Learning Control with Relative Entropy Regularization Sep 26, 2023 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 15 Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning Oct 8, 2024 GSM8K Multi-agent Reinforcement Learning
Code Code Available 15 Effective control of two-dimensional Rayleigh--Bénard convection: invariant multi-agent reinforcement learning is all you need Apr 5, 2023 All Deep Reinforcement Learning
Code Code Available 15 Energy-based Surprise Minimization for Multi-Agent Value Factorization Sep 16, 2020 Multi-agent Reinforcement Learning Q-Learning
Code Code Available 15 Celebrating Diversity in Shared Multi-Agent Reinforcement Learning Jun 4, 2021 Diversity Multi-agent Reinforcement Learning
Code Code Available 15 CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems Jan 6, 2025 Computational Efficiency Multi-agent Reinforcement Learning
Code Code Available 15 Distributed Resource Allocation with Multi-Agent Deep Reinforcement Learning for 5G-V2V Communication Oct 11, 2020 Deep Reinforcement Learning Distributed Optimization
Code Code Available 15 Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search May 19, 2022 Decision Making Image Captioning
Code Code Available 15 CoLight: Learning Network-level Cooperation for Traffic Signal Control May 11, 2019 Multi-agent Reinforcement Learning Reinforcement Learning
Code Code Available 15 C-COMA: A CONTINUAL REINFORCEMENT LEARNING MODEL FOR DYNAMIC MULTIAGENT ENVIRONMENTS Apr 5, 2021 Continual Learning Multi-agent Reinforcement Learning
Code Code Available 15 ALMA: Hierarchical Learning for Composite Multi-Agent Tasks May 27, 2022 Decision Making Inductive Bias
Code Code Available 15 Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning Jun 7, 2020 counterfactual Multi-agent Reinforcement Learning
Code Code Available 15 Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning Jun 7, 2021 Multi-agent Reinforcement Learning Offline RL
Code Code Available 15 A MARL Based Multi-Target Tracking Algorithm Under Jamming Against Radar Dec 17, 2024 Multi-agent Reinforcement Learning
Code Code Available 15 Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models Jun 9, 2025 Multi-agent Reinforcement Learning Safety Alignment
Code Code Available 15 CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning Jun 19, 2023 Conformal Prediction Decision Making
Code Code Available 15 CityLearn: Standardizing Research in Multi-Agent Reinforcement Learning for Demand Response and Urban Energy Management Dec 18, 2020 energy management Management
Code Code Available 15 Neural Auto-Curricula Jun 4, 2021 Multi-agent Reinforcement Learning
Code Code Available 15 Efficient Multi-agent Reinforcement Learning by Planning May 20, 2024 Computational Efficiency Model-based Reinforcement Learning
Code Code Available 15 Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning Dec 4, 2019 Decoder Multi-agent Reinforcement Learning
Code Code Available 15 Collaborative Visual Navigation Jul 2, 2021 Multi-agent Reinforcement Learning Navigate
Code Code Available 15 Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information Principles Apr 3, 2023 Multi-agent Reinforcement Learning Starcraft
Code Code Available 15 A multi-agent reinforcement learning model of common-pool resource appropriation Jul 20, 2017 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 15 Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration Nov 22, 2021 Efficient Exploration Multi-agent Reinforcement Learning
Code Code Available 15 Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning May 4, 2024 Attribute Multi-agent Reinforcement Learning
Code Code Available 15