InstanceDiffusion: Instance-level Control for Image Generation Feb 5, 2024 Conditional Text-to-Image Synthesis Image Generation
Code Code Available 45 T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models Feb 16, 2023 Image Generation Style Transfer
Code Code Available 45 ImgEdit: A Unified Image Editing Dataset and Benchmark May 26, 2025 Image Editing
Code Code Available 45 SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Mar 25, 2024 Decoder GPU
Code Code Available 45 SEED-Story: Multimodal Long Story Generation with Large Language Model Jul 11, 2024 Image Generation Language Modeling
Code Code Available 45 SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation Apr 22, 2024 Image Generation
Code Code Available 45 Story-Adapter: A Training-free Iterative Framework for Long Story Visualization Oct 8, 2024 Image Generation Story Visualization
Code Code Available 45 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT May 1, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 45 ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models Mar 4, 2024 Image Generation
Code Code Available 45 Guiding a Diffusion Model with a Bad Version of Itself Jun 4, 2024 Image Generation
Code Code Available 45 Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion Oct 5, 2023 Image Generation Text to Image Generation
Code Code Available 45 Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Oct 6, 2023 GPU Image Generation
Code Code Available 45 Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Nov 10, 2024 Attribute Image Generation
Code Code Available 45 Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators Mar 23, 2023 Image Generation Text-to-Video Generation
Code Code Available 45 High-Resolution Image Synthesis with Latent Diffusion Models Dec 20, 2021 Denoising GPU
Code Code Available 45 ArchiSound: Audio Generation with Diffusion Jan 30, 2023 Audio Generation GPU
Code Code Available 45 Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation Oct 9, 2023 Action Recognition Image Generation
Code Code Available 45 Training-free Regional Prompting for Diffusion Transformers Nov 4, 2024 Image Generation Text to Image Generation
Code Code Available 45 PromptFix: You Prompt and We Fix the Photo May 27, 2024 Denoising Image Generation
Code Code Available 45 Prompt-to-Prompt Image Editing with Cross Attention Control Aug 2, 2022 Image Generation Text-based Image Editing
Code Code Available 45 GLIGEN: Open-Set Grounded Text-to-Image Generation Jan 17, 2023 Conditional Text-to-Image Synthesis Image Generation
Code Code Available 45 LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Nov 9, 2023 GPU Image Generation
Code Code Available 45 Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Apr 15, 2024 Image Generation Image Restoration
Code Code Available 45 AnyText: Multilingual Visual Text Generation And Editing Nov 6, 2023 Image Generation Optical Character Recognition (OCR)
Code Code Available 45 Long-CLIP: Unlocking the Long-Text Capability of CLIP Mar 22, 2024 Image Generation Image Retrieval
Code Code Available 45 A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANs Jan 20, 2025 Diversity Image Generation
Code Code Available 45 PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis Sep 30, 2023 GPU
Code Code Available 45 Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Sep 17, 2024 Conditional Image Generation Depth Estimation
Code Code Available 45 Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation Sep 6, 2024 Image Generation Image Reconstruction
Code Code Available 45 Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Jun 4, 2024 Face Swapping GPU
Code Code Available 45 Elucidating the Design Space of Diffusion-Based Generative Models Jun 1, 2022 Image Generation
Code Code Available 45 Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications Jan 11, 2024 image-classification Image Classification
Code Code Available 45 One Diffusion to Generate Them All Nov 25, 2024 All Camera Pose Estimation
Code Code Available 45 Phased Consistency Models May 28, 2024 Image Generation Video Generation
Code Code Available 45 Diffusion Models: A Comprehensive Survey of Methods and Applications Sep 2, 2022 Image Generation Image Super-Resolution
Code Code Available 45 Null-text Inversion for Editing Real Images using Guided Diffusion Models Nov 17, 2022 Image Generation Text-based Image Editing
Code Code Available 45 AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data Feb 1, 2024 Conditional Image Generation Denoising
Code Code Available 45 Diffusion Model-Based Image Editing: A Survey Feb 27, 2024 Denoising Image Generation
Code Code Available 45 Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion Jan 27, 2023 GPU Image Generation
Code Code Available 45 OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models Mar 16, 2024 Denoising Image Generation
Code Code Available 45 Autoregressive Models in Vision: A Survey Nov 8, 2024 3D Generation Image Generation
Code Code Available 45 Autoregressive Video Generation without Vector Quantization Dec 18, 2024 Image Generation Prediction
Code Code Available 45 MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis Jul 2, 2024 Attribute Image Generation
Code Code Available 45 DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing Feb 4, 2024 Image Generation
Code Code Available 45 MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis Feb 8, 2024 Attribute Conditional Text-to-Image Synthesis
Code Code Available 45 Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction May 5, 2025 Image Generation multimodal interaction
Code Code Available 45 A Survey on Video Diffusion Models Oct 16, 2023 Image Generation Survey
Code Code Available 45 3D-aware Conditional Image Synthesis Feb 16, 2023 Image Generation
Code Code Available 45 Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step Jan 23, 2025 Image Generation Text-to-Image Generation
Code Code Available 45 Ming-Omni: A Unified Multimodal Model for Perception and Generation Jun 11, 2025 Image Generation text-to-speech
Code Code Available 45