Anytime-Constrained Equilibria in Polynomial Time

2024-10-31Unverified0· sign in to hype

Jeremy McMahan

Unverified — Be the first to reproduce this paper.

Abstract

We extend anytime constraints to the Markov game setting and the corresponding solution concept of an anytime-constrained equilibrium (ACE). Then, we present a comprehensive theory of anytime-constrained equilibria that includes (1) a computational characterization of feasible policies, (2) a fixed-parameter tractable algorithm for computing ACE, and (3) a polynomial-time algorithm for approximately computing ACE. Since computing a feasible policy is NP-hard even for two-player zero-sum games, our approximation guarantees are optimal so long as P NP. We also develop the first theory of efficient computation for action-constrained Markov games, which may be of independent interest.

Tasks

Multi-agent Reinforcement Learning reinforcement-learning Reinforcement Learning

Anytime-Constrained Equilibria in Polynomial Time

Abstract

Tasks

Reproductions