PromptFix: You Prompt and We Fix the Photo May 27, 2024 Denoising Image Generation
Code Code Available 45 ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models Mar 4, 2024 Image Generation
Code Code Available 45 Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators Mar 23, 2023 Image Generation Text-to-Video Generation
Code Code Available 45 DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge Jul 6, 2025 Image Generation Multimodal Reasoning
Code Code Available 35 Behavior Generation with Latent Actions Mar 5, 2024 Autonomous Driving Decision Making
Code Code Available 35 PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360deg Jan 1, 2023 Image Generation Image Segmentation
Code Code Available 35 Personalized Image Generation with Deep Generative Models: A Decade Survey Feb 18, 2025 Image Generation Personalized Image Generation
Code Code Available 35 ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Feb 25, 2025 Image Generation
Code Code Available 35 PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360^ Mar 23, 2023 Image Generation Image Segmentation
Code Code Available 35 Personalize Segment Anything Model with One Shot May 4, 2023 Image Generation model
Code Code Available 35 DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis May 23, 2024 Image Generation Mamba
Code Code Available 35 Optimal Stepsize for Diffusion Sampling Mar 27, 2025 Denoising Image Generation
Code Code Available 35 DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation Feb 19, 2024 Image Generation
Code Code Available 35 Ovis-U1 Technical Report Jun 29, 2025 Image Generation Text to Image Generation
Code Code Available 35 On Noise Injection in Generative Adversarial Networks Jun 10, 2020 Image Generation
Code Code Available 35 Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Oct 2, 2024 Image Generation Text to Image Generation
Code Code Available 35 Generating Long Sequences with Sparse Transformers Apr 23, 2019 Diversity Image Generation
Code Code Available 35 One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale Mar 12, 2023 All Image Generation
Code Code Available 35 On the Trajectory Regularity of ODE-based Diffusion Sampling May 18, 2024 Denoising Image Generation
Code Code Available 35 Paint by Example: Exemplar-based Image Editing with Diffusion Models Nov 23, 2022 Image Generation Image Manipulation
Code Code Available 35 On Distillation of Guided Diffusion Models Oct 6, 2022 Denoising Image Generation
Code Code Available 35 AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation Oct 8, 2024 Denoising Image Generation
Code Code Available 35 Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Apr 30, 2025 Image Generation
Code Code Available 35 Multimodal Foundation Models: From Specialists to General-Purpose Assistants Sep 18, 2023 Image Generation Survey
Code Code Available 35 MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance Jun 11, 2024 Image Generation Text to Image Generation
Code Code Available 35 ModelScope Text-to-Video Technical Report Aug 12, 2023 Denoising Image Generation
Code Code Available 35 Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models Mar 24, 2025 4k Image Generation
Code Code Available 35 MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Apr 8, 2024 Image Generation Image-to-Image Translation
Code Code Available 35 MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost Dec 2, 2024 Image Generation
Code Code Available 35 Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Oct 10, 2024 Feature Compression Image Generation
Code Code Available 35 MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer Jan 19, 2023 Image Generation Image Segmentation
Code Code Available 35 MaskGIT: Masked Generative Image Transformer Feb 8, 2022 Decoder Image Generation
Code Code Available 35 Magic-Me: Identity-Specific Video Customized Diffusion Feb 14, 2024 Image Generation Text to Image Generation
Code Code Available 35 MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model Nov 1, 2022 Anomaly Detection Brain Tumor Segmentation
Code Code Available 35 MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation Feb 16, 2023 Image Generation Text to Image Generation
Code Code Available 35 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt Jan 23, 2025 Image Generation Story Generation
Code Code Available 35 Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization Aug 28, 2023 Image Enhancement Image Generation
Code Code Available 35 DF40: Toward Next-Generation Deepfake Detection Jun 19, 2024 DeepFake Detection Face Reenactment
Code Code Available 35 DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing Mar 21, 2024 Image Generation spatial-aware image editing
Code Code Available 35 An Image is Worth 32 Tokens for Reconstruction and Generation Jun 11, 2024 Image Generation Image Reconstruction
Code Code Available 35 Designing a Better Asymmetric VQGAN for StableDiffusion Jun 7, 2023 Decoder Image Generation
Code Code Available 35 DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks Feb 24, 2025 Conditional Image Generation Image Generation
Code Code Available 35 LLMs can see and hear without any training Jan 30, 2025 Audio captioning Image Generation
Code Code Available 35 Deep Generative Models on 3D Representations: A Survey Oct 27, 2022 3D-Aware Image Synthesis 3D Shape Generation
Code Code Available 35 Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models Feb 7, 2024 counterfactual Image Generation
Code Code Available 35 Autoregressive Image Generation using Residual Quantization Mar 3, 2022 Conditional Image Generation Image Generation
Code Code Available 35 Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation Mar 3, 2025 3D Generation 3D Reconstruction
Code Code Available 35 DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models Feb 8, 2022 Diagnostic Image Captioning
Code Code Available 35 AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation Jun 3, 2024 Image Generation
Code Code Available 35 CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation Oct 12, 2024 Conditional Image Generation GPU
Code Code Available 35