Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

2023-01-26Unverified0· sign in to hype

Nikolai Karpov, Qin Zhang

Unverified — Be the first to reproduce this paper.

Abstract

In this paper, we study the collaborative learning model, which concerns the tradeoff between parallelism and communication overhead in multi-agent multi-armed bandits. For regret minimization in multi-armed bandits, we present the first set of tradeoffs between the number of rounds of communication among the agents and the regret of the collaborative learning process.

Tasks

Multi-agent Reinforcement Learning Multi-Armed Bandits reinforcement-learning Reinforcement Learning (RL)

Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

Abstract

Tasks

Reproductions