StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation Sep 19, 2024 Image Generation Personalized Image Generation
Code Code Available 4The GAN is dead; long live the GAN! A Modern GAN Baseline Jan 9, 2025 Image Generation
Code Code Available 4StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis Jun 19, 2022 Generative Adversarial Network Image Generation
Code Code Available 4DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge Jul 6, 2025 Image Generation Multimodal Reasoning
Code Code Available 3Behavior Generation with Latent Actions Mar 5, 2024 Autonomous Driving Decision Making
Code Code Available 3PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360deg Jan 1, 2023 Image Generation Image Segmentation
Code Code Available 3PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360^ Mar 23, 2023 Image Generation Image Segmentation
Code Code Available 3Personalized Image Generation with Deep Generative Models: A Decade Survey Feb 18, 2025 Image Generation Personalized Image Generation
Code Code Available 3Paint by Example: Exemplar-based Image Editing with Diffusion Models Nov 23, 2022 Image Generation Image Manipulation
Code Code Available 3Personalize Segment Anything Model with One Shot May 4, 2023 Image Generation model
Code Code Available 3DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis May 23, 2024 Image Generation Mamba
Code Code Available 3On the Trajectory Regularity of ODE-based Diffusion Sampling May 18, 2024 Denoising Image Generation
Code Code Available 3DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation Feb 19, 2024 Image Generation
Code Code Available 3Optimal Stepsize for Diffusion Sampling Mar 27, 2025 Denoising Image Generation
Code Code Available 3One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt Jan 23, 2025 Image Generation Story Generation
Code Code Available 3One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale Mar 12, 2023 All Image Generation
Code Code Available 3Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Oct 2, 2024 Image Generation Text to Image Generation
Code Code Available 3Generating Long Sequences with Sparse Transformers Apr 23, 2019 Diversity Image Generation
Code Code Available 3AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation Jun 3, 2024 Image Generation
Code Code Available 3On Noise Injection in Generative Adversarial Networks Jun 10, 2020 Image Generation
Code Code Available 3Ovis-U1 Technical Report Jun 29, 2025 Image Generation Text to Image Generation
Code Code Available 3Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Apr 30, 2025 Image Generation
Code Code Available 3Autoregressive Image Generation using Residual Quantization Mar 3, 2022 Conditional Image Generation Image Generation
Code Code Available 3On Distillation of Guided Diffusion Models Oct 6, 2022 Denoising Image Generation
Code Code Available 3MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance Jun 11, 2024 Image Generation Text to Image Generation
Code Code Available 3Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models Mar 24, 2025 4k Image Generation
Code Code Available 3MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost Dec 2, 2024 Image Generation
Code Code Available 3ModelScope Text-to-Video Technical Report Aug 12, 2023 Denoising Image Generation
Code Code Available 3MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Apr 8, 2024 Image Generation Image-to-Image Translation
Code Code Available 3MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation Feb 16, 2023 Image Generation Text to Image Generation
Code Code Available 3Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance Dec 17, 2024 Image Generation Object
Code Code Available 3Attention Distillation: A Unified Approach to Visual Characteristics Transfer Feb 27, 2025 Denoising Image Generation
Code Code Available 3Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Oct 10, 2024 Feature Compression Image Generation
Code Code Available 3MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model Nov 1, 2022 Anomaly Detection Brain Tumor Segmentation
Code Code Available 3MaskGIT: Masked Generative Image Transformer Feb 8, 2022 Decoder Image Generation
Code Code Available 3MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer Jan 19, 2023 Image Generation Image Segmentation
Code Code Available 3Multimodal Foundation Models: From Specialists to General-Purpose Assistants Sep 18, 2023 Image Generation Survey
Code Code Available 3Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization Aug 28, 2023 Image Enhancement Image Generation
Code Code Available 3Designing a Better Asymmetric VQGAN for StableDiffusion Jun 7, 2023 Decoder Image Generation
Code Code Available 3DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing Mar 21, 2024 Image Generation spatial-aware image editing
Code Code Available 3LLMs can see and hear without any training Jan 30, 2025 Audio captioning Image Generation
Code Code Available 3DF40: Toward Next-Generation Deepfake Detection Jun 19, 2024 DeepFake Detection Face Reenactment
Code Code Available 3Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation Mar 3, 2025 3D Generation 3D Reconstruction
Code Code Available 3Deep Generative Models on 3D Representations: A Survey Oct 27, 2022 3D-Aware Image Synthesis 3D Shape Generation
Code Code Available 3DDT: Decoupled Diffusion Transformer Apr 8, 2025 Denoising Image Generation
Code Code Available 3DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models Feb 8, 2022 Diagnostic Image Captioning
Code Code Available 3ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Feb 25, 2025 Image Generation
Code Code Available 3Deciphering Oracle Bone Language with Diffusion Models Jun 2, 2024 Decipherment Image Generation
Code Code Available 3Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework Oct 28, 2024 Image Generation Image Manipulation
Code Code Available 3Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens Jun 20, 2025 Image Generation Multimodal Reasoning
Code Code Available 3