T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models Feb 16, 2023 Image Generation Style Transfer
Code Code Available 4Story-Adapter: A Training-free Iterative Framework for Long Story Visualization Oct 8, 2024 Image Generation Story Visualization
Code Code Available 4SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Mar 25, 2024 Decoder GPU
Code Code Available 4SEED-Story: Multimodal Long Story Generation with Large Language Model Jul 11, 2024 Image Generation Language Modeling
Code Code Available 4ImgEdit: A Unified Image Editing Dataset and Benchmark May 26, 2025 Image Editing
Code Code Available 4SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation Apr 22, 2024 Image Generation
Code Code Available 4ControlVAE: Tuning, Analytical Properties, and Performance Analysis Oct 31, 2020 Disentanglement Image Generation
Code Code Available 4InstanceDiffusion: Instance-level Control for Image Generation Feb 5, 2024 Conditional Text-to-Image Synthesis Image Generation
Code Code Available 4StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation Sep 19, 2024 Image Generation Personalized Image Generation
Code Code Available 4T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT May 1, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 4ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models Mar 4, 2024 Image Generation
Code Code Available 4Guiding a Diffusion Model with a Bad Version of Itself Jun 4, 2024 Image Generation
Code Code Available 4Taming Scalable Visual Tokenizer for Autoregressive Image Generation Dec 3, 2024 Image Generation Image Reconstruction
Code Code Available 4VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation Mar 15, 2023 Code Generation Denoising
Code Code Available 4High-Resolution Image Synthesis with Latent Diffusion Models Dec 20, 2021 Denoising GPU
Code Code Available 4The GAN is dead; long live the GAN! A Modern GAN Baseline Jan 9, 2025 Image Generation
Code Code Available 4Training-free Regional Prompting for Diffusion Transformers Nov 4, 2024 Image Generation Text to Image Generation
Code Code Available 4Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion Oct 5, 2023 Image Generation Text to Image Generation
Code Code Available 4AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data Feb 1, 2024 Conditional Image Generation Denoising
Code Code Available 4A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANs Jan 20, 2025 Diversity Image Generation
Code Code Available 4GLIGEN: Open-Set Grounded Text-to-Image Generation Jan 17, 2023 Conditional Text-to-Image Synthesis Image Generation
Code Code Available 4Prompt-to-Prompt Image Editing with Cross Attention Control Aug 2, 2022 Image Generation Text-based Image Editing
Code Code Available 4Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Apr 15, 2024 Image Generation Image Restoration
Code Code Available 4PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis Sep 30, 2023 GPU
Code Code Available 4Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Sep 26, 2024 3D Reconstruction Denoising
Code Code Available 4Long-CLIP: Unlocking the Long-Text Capability of CLIP Mar 22, 2024 Image Generation Image Retrieval
Code Code Available 4PromptFix: You Prompt and We Fix the Photo May 27, 2024 Denoising Image Generation
Code Code Available 4Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Nov 10, 2024 Attribute Image Generation
Code Code Available 4Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Jun 4, 2024 Face Swapping GPU
Code Code Available 4Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation Sep 6, 2024 Image Generation Image Reconstruction
Code Code Available 4Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step Jan 23, 2025 Image Generation Text-to-Image Generation
Code Code Available 4One Diffusion to Generate Them All Nov 25, 2024 All Camera Pose Estimation
Code Code Available 4Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications Jan 11, 2024 image-classification Image Classification
Code Code Available 4Elucidating the Design Space of Diffusion-Based Generative Models Jun 1, 2022 Image Generation
Code Code Available 4Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Sep 17, 2024 Conditional Image Generation Depth Estimation
Code Code Available 4OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models Mar 16, 2024 Denoising Image Generation
Code Code Available 4Phased Consistency Models May 28, 2024 Image Generation Video Generation
Code Code Available 4Diffusion Models: A Comprehensive Survey of Methods and Applications Sep 2, 2022 Image Generation Image Super-Resolution
Code Code Available 4Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion Jan 27, 2023 GPU Image Generation
Code Code Available 4Diffusion Model-Based Image Editing: A Survey Feb 27, 2024 Denoising Image Generation
Code Code Available 4Autoregressive Models in Vision: A Survey Nov 8, 2024 3D Generation Image Generation
Code Code Available 4Autoregressive Video Generation without Vector Quantization Dec 18, 2024 Image Generation Prediction
Code Code Available 4MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis Jul 2, 2024 Attribute Image Generation
Code Code Available 4MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis Feb 8, 2024 Attribute Conditional Text-to-Image Synthesis
Code Code Available 4DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing Feb 4, 2024 Image Generation
Code Code Available 4Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction May 5, 2025 Image Generation multimodal interaction
Code Code Available 4A Survey on Video Diffusion Models Oct 16, 2023 Image Generation Survey
Code Code Available 43D-aware Conditional Image Synthesis Feb 16, 2023 Image Generation
Code Code Available 4Ming-Omni: A Unified Multimodal Model for Perception and Generation Jun 11, 2025 Image Generation text-to-speech
Code Code Available 4Null-text Inversion for Editing Real Images using Guided Diffusion Models Nov 17, 2022 Image Generation Text-based Image Editing
Code Code Available 4