Bellman Diffusion Models

2024-07-16Unverified0· sign in to hype

Liam Schramm, Abdeslam Boularias

Unverified — Be the first to reproduce this paper.

Abstract

Diffusion models have seen tremendous success as generative architectures. Recently, they have been shown to be effective at modelling policies for offline reinforcement learning and imitation learning. We explore using diffusion as a model class for the successor state measure (SSM) of a policy. We find that enforcing the Bellman flow constraints leads to a simple Bellman update on the diffusion step distribution.

Tasks

Imitation Learning reinforcement-learning Reinforcement Learning

Bellman Diffusion Models

Abstract

Tasks

Reproductions