SEED-Bench-2: Benchmarking Multimodal Large Language Models Nov 28, 2023 Benchmarking Image Generation
Code Code Available 2Text-Driven Image Editing via Learnable Regions Nov 28, 2023 Image Generation
Code Code Available 2LLMGA: Multimodal Large Language Model based Generation Assistant Nov 27, 2023 Image Generation Language Modeling
Code Code Available 2Flow-Guided Diffusion for Video Inpainting Nov 26, 2023 Denoising Image Generation
Code Code Available 2MVControl: Adding Conditional Control to Multi-view Diffusion for Controllable Text-to-3D Generation Nov 24, 2023 3D Generation Image Generation
Code Code Available 2Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models Nov 22, 2023 Denoising Image Generation
Code Code Available 2The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Nov 16, 2023 Consistent Character Generation Image Generation
Code Code Available 2Matryoshka Diffusion Models Oct 23, 2023 Image Generation Zero-shot Generalization
Code Code Available 2A Pytorch Reproduction of Masked Generative Image Transformer Oct 22, 2023 Image Generation
Code Code Available 2LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation Oct 16, 2023 GPU Image Animation
Code Code Available 2PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm Oct 12, 2023 3D Object Detection 3D Reconstruction
Code Code Available 2ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Oct 11, 2023 Image Generation
Code Code Available 2DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model Oct 11, 2023 Autonomous Driving Image Generation
Code Code Available 2Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models Oct 11, 2023 Code Generation Image Generation
Code Code Available 2Aligning Text-to-Image Diffusion Models with Reward Backpropagation Oct 5, 2023 Denoising Image Generation
Code Code Available 2MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens Oct 3, 2023 Image Generation multimodal generation
Code Code Available 2Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code Oct 2, 2023 Image Generation Text-based Image Editing
Code Code Available 2Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion Oct 1, 2023 Denoising Image Generation
Code Code Available 2InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists Sep 30, 2023 Depth Estimation Image Generation
Code Code Available 2Denoising Diffusion Bridge Models Sep 29, 2023 Denoising Image Generation
Code Code Available 2Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning Sep 5, 2023 Decoder Image Generation
Code Code Available 2Relay Diffusion: Unifying diffusion process across resolutions for image synthesis Sep 4, 2023 Image Generation
Code Code Available 2Residual Denoising Diffusion Models Aug 25, 2023 Denoising Diversity
Code Code Available 2Dense Text-to-Image Generation with Attention Modulation Aug 24, 2023 Image Generation Text to Image Generation
Code Code Available 2Bayesian Flow Networks Aug 14, 2023 Bayesian Inference Data Compression
Code Code Available 2DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models Aug 11, 2023 Dataset Generation Decoder
Code Code Available 2Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow Aug 11, 2023 Denoising Image Generation
Code Code Available 2ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints Aug 3, 2023 Image Generation Language Modelling
Code Code Available 2A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models Jul 24, 2023 Image Generation Image-text matching
Code Code Available 2Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning Jul 21, 2023 Diffusion Personalization Diffusion Personalization Tuning Free
Code Code Available 2BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion Jul 20, 2023 Conditional Text-to-Image Synthesis Denoising
Code Code Available 2Flow Matching in Latent Space Jul 17, 2023 Computational Efficiency Image Generation
Code Code Available 2Planting a SEED of Vision in Large Language Model Jul 16, 2023 Image Generation Image to text
Code Code Available 2T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation Jul 12, 2023 Attribute Image Generation
Code Code Available 2SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Jul 4, 2023 Image Generation
Code Code Available 2MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion Jul 3, 2023 Image Generation
Code Code Available 2DreamDiffusion: Generating High-Quality Images from Brain EEG Signals Jun 29, 2023 EEG Electroencephalogram (EEG)
Code Code Available 2Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis Jun 15, 2023 Image Generation Preference Mapping
Code Code Available 2Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models Jun 7, 2023 Diversity Image Generation
Code Code Available 2VideoComposer: Compositional Video Synthesis with Motion Controllability Jun 3, 2023 Image Generation Text-to-Video Generation
Code Code Available 2StyleDrop: Text-to-Image Generation in Any Style Jun 1, 2023 Image Generation Text to Image Generation
Code Code Available 2Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models Jun 1, 2023 Image Generation Story Visualization
Code Code Available 2Differential Diffusion: Giving Each Pixel Its Strength Jun 1, 2023 Image Generation Text-based Image Editing
Code Code Available 2ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation Jun 1, 2023 Image Generation Text to Image Generation
Code Code Available 2STEVE-1: A Generative Model for Text-to-Behavior in Minecraft Jun 1, 2023 Decision Making Image Generation
Code Code Available 2Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models Jun 1, 2023 GPU Image Compression
Code Code Available 2Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust May 31, 2023 Image Generation
Code Code Available 2Cones 2: Customizable Image Synthesis with Multiple Subjects May 30, 2023 Image Generation
Code Code Available 2GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction May 30, 2023 Image Generation Instruction Following
Code Code Available 2Conditional Diffusion Models for Semantic 3D Brain MRI Synthesis May 29, 2023 Data Augmentation Image Generation
Code Code Available 2