Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games Feb 13, 2025 Multi-agent Reinforcement Learning Uncertainty Quantification
— Unverified 00 Improving the generalizability and robustness of large-scale traffic signal control Jun 2, 2023 Deep Reinforcement Learning Distributional Reinforcement Learning
— Unverified 00 Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning Jun 11, 2019 Decision Making Deep Reinforcement Learning
— Unverified 00 B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning Jan 30, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 00 Improving International Climate Policy via Mutually Conditional Binding Commitments Jul 26, 2023 Decision Making Multi-agent Reinforcement Learning
— Unverified 00 Group-Agent Reinforcement Learning Feb 10, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 00 Independent Natural Policy Gradient Always Converges in Markov Potential Games Oct 20, 2021 Multi-agent Reinforcement Learning
— Unverified 00 Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space Aug 14, 2024 Multi-agent Reinforcement Learning SMAC
— Unverified 00 Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players Aug 15, 2024 Multi-agent Reinforcement Learning
— Unverified 00 Improved cooperation by balancing exploration and exploitation in intertemporal social dilemma tasks Oct 19, 2021 Attribute Diversity
— Unverified 00 DCMAC: Demand-aware Customized Multi-Agent Communication via Upper Bound Training Sep 11, 2024 Multi-agent Reinforcement Learning
— Unverified 00 Impression Allocation and Policy Search in Display Advertising Mar 11, 2022 Multi-agent Reinforcement Learning
— Unverified 00 Inducing Cooperation via Learning to reshape rewards in semi-cooperative multi-agent reinforcement learning May 1, 2019 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 00 Implementations that Matter in Cooperative Multi-Agent Reinforcement Learning Jan 17, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 00 Inductive Bias for Emergent Communication in a Continuous Setting Jun 6, 2023 Inductive Bias Multi-agent Reinforcement Learning
— Unverified 00 DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning Dec 10, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 00 A Variational Approach to Mutual Information-Based Coordination for Multi-Agent Reinforcement Learning Mar 1, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 00 Influence-Based Reinforcement Learning for Intrinsically-Motivated Agents Aug 28, 2021 counterfactual Multi-agent Reinforcement Learning
— Unverified 00 Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning Sep 29, 2021 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 00 A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem May 26, 2023 MuJoCo Multi-agent Reinforcement Learning
— Unverified 00 Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning Feb 28, 2024 Action Generation Multi-agent Reinforcement Learning
— Unverified 00 Information Structure in Mappings: An Approach to Learning, Representation, and Generalisation May 29, 2025 Multi-agent Reinforcement Learning
— Unverified 00 Metric Policy Representations for Opponent Modeling Jun 10, 2021 Multi-agent Reinforcement Learning
— Unverified 00 Integrating independent and centralized multi-agent reinforcement learning for traffic signal network optimization Sep 23, 2019 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 00 Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning Nov 8, 2024 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 00