Improved Conditional VRNNs for Video Prediction

2019-04-27ICCV 2019Code Available0· sign in to hype

Lluis Castrejon, Nicolas Ballas, Aaron Courville

Code Available — Be the first to reproduce this paper.

Code

github.com/facebookresearch/improved_vrnn
pytorch★ 0

Abstract

Predicting future frames for a video sequence is a challenging generative modeling task. Promising approaches include probabilistic latent variable models such as the Variational Auto-Encoder. While VAEs can handle uncertainty and model multiple possible future outcomes, they have a tendency to produce blurry predictions. In this work we argue that this is a sign of underfitting. To address this issue, we propose to increase the expressiveness of the latent distributions and to use higher capacity likelihood models. Our approach relies on a hierarchy of latent variables, which defines a family of flexible prior and posterior distributions in order to better model the probability of future sequences. We validate our proposal through a series of ablation experiments and compare our approach to current state-of-the-art latent variable models. Our method performs favorably under several metrics in three different datasets.

Tasks

Prediction Video Generation Video Prediction

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
BAIR Robot Pushing	Hier-VRNN	FVD score	143.4	—	Unverified
BAIR Robot Pushing	VRNN 1L	FVD score	149.22	—	Unverified

Improved Conditional VRNNs for Video Prediction

Code

Abstract

Tasks

Benchmark Results

Reproductions