| Hierarchical Text-Conditional Image Generation with CLIP Latents | Apr 13, 2022 | Conditional Image GenerationDecoder | CodeCode Available | 3 |
| Zero-Shot Text-to-Image Generation | Feb 24, 2021 | Image GenerationText to Image Generation | CodeCode Available | 3 |
| DreamLLM: Synergistic Multimodal Comprehension and Creation | Sep 20, 2023 | multimodal generationVisual Question Answering | CodeCode Available | 2 |
| Blended Latent Diffusion | Jun 6, 2022 | Image GenerationImage Inpainting | CodeCode Available | 2 |
| GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models | Dec 20, 2021 | DiversityImage Generation | CodeCode Available | 2 |
| CogView: Mastering Text-to-Image Generation via Transformers | May 26, 2021 | Image GenerationSuper-Resolution | CodeCode Available | 2 |
| Shifted Diffusion for Text-to-image Generation | Nov 24, 2022 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization | Dec 2, 2021 | counterfactualImage Generation | CodeCode Available | 1 |
| Blended Diffusion for Text-driven Editing of Natural Images | Nov 29, 2021 | text-guided-image-editingText-to-Image Generation | CodeCode Available | 1 |
| LAFITE: Towards Language-Free Training for Text-to-Image Generation | Nov 27, 2021 | Image GenerationText to Image Generation | CodeCode Available | 1 |