Autoregressive Models in Vision: A Survey Nov 8, 2024 3D Generation Image Generation
Code Code Available 4Taming Rectified Flow for Inversion and Editing Nov 7, 2024 Image Generation Text-to-Image Generation
Code Code Available 4Training-free Regional Prompting for Diffusion Transformers Nov 4, 2024 Image Generation Text to Image Generation
Code Code Available 4When Does Perceptual Alignment Benefit Vision Representations? Oct 14, 2024 Depth Estimation Image Generation
Code Code Available 4Story-Adapter: A Training-free Iterative Framework for Long Story Visualization Oct 8, 2024 Image Generation Story Visualization
Code Code Available 4Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Sep 26, 2024 3D Reconstruction Denoising
Code Code Available 4StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation Sep 19, 2024 Image Generation Personalized Image Generation
Code Code Available 4Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think Sep 17, 2024 Conditional Image Generation Depth Estimation
Code Code Available 4Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation Sep 6, 2024 Image Generation Image Reconstruction
Code Code Available 4SEED-Story: Multimodal Long Story Generation with Large Language Model Jul 11, 2024 Image Generation Language Modeling
Code Code Available 4MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis Jul 2, 2024 Attribute Image Generation
Code Code Available 4Guiding a Diffusion Model with a Bad Version of Itself Jun 4, 2024 Image Generation
Code Code Available 4Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Jun 4, 2024 Face Swapping GPU
Code Code Available 4Phased Consistency Models May 28, 2024 Image Generation Video Generation
Code Code Available 4PromptFix: You Prompt and We Fix the Photo May 27, 2024 Denoising Image Generation
Code Code Available 4SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation Apr 22, 2024 Image Generation
Code Code Available 4Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Apr 15, 2024 Image Generation Image Restoration
Code Code Available 4SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Mar 25, 2024 Decoder GPU
Code Code Available 4Long-CLIP: Unlocking the Long-Text Capability of CLIP Mar 22, 2024 Image Generation Image Retrieval
Code Code Available 4OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models Mar 16, 2024 Denoising Image Generation
Code Code Available 4SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models Mar 14, 2024 Blocking GPU
Code Code Available 4ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models Mar 4, 2024 Image Generation
Code Code Available 4Diffusion Model-Based Image Editing: A Survey Feb 27, 2024 Denoising Image Generation
Code Code Available 4MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis Feb 8, 2024 Attribute Conditional Text-to-Image Synthesis
Code Code Available 4InstanceDiffusion: Instance-level Control for Image Generation Feb 5, 2024 Conditional Text-to-Image Synthesis Image Generation
Code Code Available 4DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing Feb 4, 2024 Image Generation
Code Code Available 4AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data Feb 1, 2024 Conditional Image Generation Denoising
Code Code Available 4Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications Jan 11, 2024 image-classification Image Classification
Code Code Available 4DemoFusion: Democratising High-Resolution Image Generation With No $ Nov 24, 2023 Image Generation
Code Code Available 4LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Nov 9, 2023 GPU Image Generation
Code Code Available 4AnyText: Multilingual Visual Text Generation And Editing Nov 6, 2023 Image Generation Optical Character Recognition (OCR)
Code Code Available 4LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing Nov 1, 2023 All Image Generation
Code Code Available 4A Survey on Video Diffusion Models Oct 16, 2023 Image Generation Survey
Code Code Available 4Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation Oct 9, 2023 Action Recognition Image Generation
Code Code Available 4Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Oct 6, 2023 GPU Image Generation
Code Code Available 4Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion Oct 5, 2023 Image Generation Text to Image Generation
Code Code Available 4PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis Sep 30, 2023 GPU
Code Code Available 4Vision + Language Applications: A Survey May 24, 2023 Image Generation Survey
Code Code Available 4Token Merging for Fast Stable Diffusion Mar 30, 2023 Image Generation
Code Code Available 4Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators Mar 23, 2023 Image Generation Text-to-Video Generation
Code Code Available 4VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation Mar 15, 2023 Code Generation Denoising
Code Code Available 4T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models Feb 16, 2023 Image Generation Style Transfer
Code Code Available 43D-aware Conditional Image Synthesis Feb 16, 2023 Image Generation
Code Code Available 4ArchiSound: Audio Generation with Diffusion Jan 30, 2023 Audio Generation GPU
Code Code Available 4Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion Jan 27, 2023 GPU Image Generation
Code Code Available 4GLIGEN: Open-Set Grounded Text-to-Image Generation Jan 17, 2023 Conditional Text-to-Image Synthesis Image Generation
Code Code Available 4Null-text Inversion for Editing Real Images using Guided Diffusion Models Nov 17, 2022 Image Generation Text-based Image Editing
Code Code Available 4Diffusion Models: A Comprehensive Survey of Methods and Applications Sep 2, 2022 Image Generation Image Super-Resolution
Code Code Available 4Prompt-to-Prompt Image Editing with Cross Attention Control Aug 2, 2022 Image Generation Text-based Image Editing
Code Code Available 4StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis Jun 19, 2022 Generative Adversarial Network Image Generation
Code Code Available 4