FineGAN: Unsupervised Hierarchical Disentanglement for Fine-Grained Object Generation and Discovery
Krishna Kumar Singh, Utkarsh Ojha, Yong Jae Lee
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/kkanshul/fineganOfficialIn paperpytorch★ 0
Abstract
We propose FineGAN, a novel unsupervised GAN framework, which disentangles the background, object shape, and object appearance to hierarchically generate images of fine-grained object categories. To disentangle the factors without supervision, our key idea is to use information theory to associate each factor to a latent code, and to condition the relationships between the codes in a specific way to induce the desired hierarchy. Through extensive experiments, we show that FineGAN achieves the desired disentanglement to generate realistic and diverse images belonging to fine-grained classes of birds, dogs, and cars. Using FineGAN's automatically learned features, we also cluster real images as a first attempt at solving the novel problem of unsupervised fine-grained object category discovery. Our code/models/demo can be found at https://github.com/kkanshul/finegan
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| CUB Birds | FineGAN | Accuracy | 0.13 | — | Unverified |
| Stanford Cars | FineGAN | Accuracy | 0.08 | — | Unverified |
| Stanford Dogs | FineGAN | Accuracy | 0.08 | — | Unverified |