Vit-GAN: Image-to-image Translation with Vision Transformes and Conditional GANS

2021-10-11Code Available1· sign in to hype

Yiğit Gündüç

Code Available — Be the first to reproduce this paper.

Code

github.com/yigitgunduc/vit-gan
OfficialIn papertf★ 18

Abstract

In this paper, we have developed a general-purpose architecture, Vit-Gan, capable of performing most of the image-to-image translation tasks from semantic image segmentation to single image depth perception. This paper is a follow-up paper, an extension of generator-based model [1] in which the obtained results were very promising. This opened the possibility of further improvements with adversarial architecture. We used a unique vision transformers-based generator architecture and Conditional GANs(cGANs) with a Markovian Discriminator (PatchGAN) (https://github.com/YigitGunduc/vit-gan). In the present work, we use images as conditioning arguments. It is observed that the obtained results are more realistic than the commonly used architectures.

Tasks

Image Segmentation Image-to-Image Translation Semantic Segmentation Translation

Vit-GAN: Image-to-image Translation with Vision Transformes and Conditional GANS

Code

Abstract

Tasks

Reproductions