Residual Flows for Invertible Generative Modeling

2019-06-06NeurIPS 2019Code Available1· sign in to hype

Ricky T. Q. Chen, Jens Behrmann, David Duvenaud, Jörn-Henrik Jacobsen

Code Available — Be the first to reproduce this paper.

Code

github.com/rtqichen/residual-flows
Officialpytorch★ 276
github.com/thu-ml/implicit-normalizing-flows
pytorch★ 37
github.com/yperugachidiaz/invertible_densenets
pytorch★ 24
github.com/eyalbetzalel/residual-flows
pytorch★ 0

Abstract

Flow-based generative models parameterize probability distributions through an invertible transformation and can be trained by maximum likelihood. Invertible residual networks provide a flexible family of transformations where only Lipschitz conditions rather than strict architectural constraints are needed for enforcing invertibility. However, prior work trained invertible residual networks for density estimation by relying on biased log-density estimates whose bias increased with the network's expressiveness. We give a tractable unbiased estimate of the log density using a "Russian roulette" estimator, and reduce the memory required during training by using an alternative infinite series for the gradient. Furthermore, we improve invertible residual blocks by proposing the use of activation functions that avoid derivative saturation and generalizing the Lipschitz condition to induced mixed norms. The resulting approach, called Residual Flows, achieves state-of-the-art performance on density estimation amongst flow-based models, and outperforms networks that use coupling blocks at joint generative and discriminative modeling.

Tasks

Density Estimation Image Generation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CelebA 256x256	Residual Flow	bpd	0.99	—	Unverified
CIFAR-10	Residual Flow	FID	46.37	—	Unverified
ImageNet 32x32	Residual Flow	bpd	4.01	—	Unverified
ImageNet 64x64	Residual Flow	Bits per dim	3.76	—	Unverified
MNIST	Residual Flow	bits/dimension	0.97	—	Unverified

Residual Flows for Invertible Generative Modeling

Code

Abstract

Tasks

Benchmark Results

Reproductions