Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation

2021-06-10Code Available1· sign in to hype

Dongjun Kim, Seungjae Shin, Kyungwoo Song, Wanmo Kang, Il-Chul Moon

Code Available — Be the first to reproduce this paper.

Code

github.com/Kim-Dongjun/Soft-Truncation
OfficialIn paperpytorch★ 50

Abstract

Recent advances in diffusion models bring state-of-the-art performance on image generation tasks. However, empirical results from previous research in diffusion models imply an inverse correlation between density estimation and sample generation performances. This paper investigates with sufficient empirical evidence that such inverse correlation happens because density estimation is significantly contributed by small diffusion time, whereas sample generation mainly depends on large diffusion time. However, training a score network well across the entire diffusion time is demanding because the loss scale is significantly imbalanced at each diffusion time. For successful training, therefore, we introduce Soft Truncation, a universally applicable training technique for diffusion models, that softens the fixed and static truncation hyperparameter into a random variable. In experiments, Soft Truncation achieves state-of-the-art performance on CIFAR-10, CelebA, CelebA-HQ 256x256, and STL-10 datasets.

Tasks

Density Estimation Image Generation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CelebA 64x64	DDPM++ (VP, NLL) + ST	FID	2.9	—	Unverified
CelebA 64x64	UNCSN++ (RVE) + ST	bits/dimension	1.97	—	Unverified
CelebA 64x64	DDPM++ (VP, FID) + ST	FID	1.9	—	Unverified
CelebA-HQ 256x256	UNCSN++ (RVE) + ST	FID	7.16	—	Unverified
FFHQ 256 x 256	UDM (RVE) + ST	FID	5.54	—	Unverified
ImageNet 32x32	DDPM++ (VP, NLL) + ST	FID	8.42	—	Unverified
LSUN Bedroom 256 x 256	UDM (RVE) + ST	FID	4.57	—	Unverified
STL-10	UNCSN++ (RVE) + ST	FID	7.71	—	Unverified

Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation

Code

Abstract

Tasks

Benchmark Results

Reproductions