Heterogeneous Stochastic Interactions for Multiple Agents in a Multi-armed Bandit Problem

2019-05-21Unverified0· sign in to hype

Udari Madhushani, Naomi Ehrich Leonard

Unverified — Be the first to reproduce this paper.

Abstract

We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors. Neighbors are defined by a network graph with heterogeneous and stochastic interconnections. These interactions are determined by the sociability of each agent, which corresponds to the probability that the agent observes its neighbors. We design an algorithm for each agent to maximize its own expected cumulative reward and prove performance bounds that depend on the sociability of the agents and the network structure. We use the bounds to predict the rank ordering of agents according to their performance and verify the accuracy analytically and computationally.

Tasks

Decision Making

Heterogeneous Stochastic Interactions for Multiple Agents in a Multi-armed Bandit Problem

Abstract

Tasks

Reproductions