Video Occupancy Models

2024-06-25Code Available1· sign in to hype

Manan Tomar, Philippe Hansen-Estruch, Philip Bachman, Alex Lamb, John Langford, Matthew E. Taylor, Sergey Levine

Code Available — Be the first to reproduce this paper.

Code

github.com/manantomar/video-occupancy-models
OfficialIn paperpytorch★ 12

Abstract

We introduce a new family of video prediction models designed to support downstream control tasks. We call these models Video Occupancy models (VOCs). VOCs operate in a compact latent space, thus avoiding the need to make predictions about individual pixels. Unlike prior latent-space world models, VOCs directly predict the discounted distribution of future states in a single step, thus avoiding the need for multistep roll-outs. We show that both properties are beneficial when building predictive models of video for use in downstream control. Code is available at github.com/manantomar/video-occupancy-models.

Tasks

Video Prediction

Video Occupancy Models

Code

Abstract

Tasks

Reproductions