MV-Adapter: Multi-view Consistent Image Generation Made Easy Dec 4, 2024 3D Generation Image Generation
Code Code Available 5VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Nov 20, 2024 Benchmarking Image Generation
Code Code Available 5Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Nov 7, 2024 Image Generation
Code Code Available 5Randomized Autoregressive Visual Generation Nov 1, 2024 Image Generation Language Modeling
Code Code Available 5FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification Oct 14, 2024 Image Generation
Code Code Available 5IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Oct 9, 2024 Attribute Image Generation
Code Code Available 5Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think Oct 9, 2024 Denoising Image Generation
Code Code Available 5Show-o: One Single Transformer to Unify Multimodal Understanding and Generation Aug 22, 2024 10-shot image generation
Code Code Available 5CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models Jul 21, 2024 All Fashion Synthesis
Code Code Available 5IMAGDressing-v1: Customizable Virtual Dressing Jul 17, 2024 Denoising Image Generation
Code Code Available 5Autoregressive Image Generation without Vector Quantization Jun 17, 2024 Image Generation Quantization
Code Code Available 5EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Jun 13, 2024 Conditional Image Generation Image Generation
Code Code Available 5VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks Jun 12, 2024 Image Generation Language Modeling
Code Code Available 5Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Jun 10, 2024 Conditional Image Generation Image Generation
Code Code Available 5Improved Distribution Matching Distillation for Fast Image Synthesis May 23, 2024 Image Generation
Code Code Available 5Diffusion for World Modeling: Visual Details Matter in Atari May 20, 2024 Image Generation reinforcement-learning
Code Code Available 5Magic Clothing: Controllable Garment-Driven Image Synthesis Apr 15, 2024 Image Generation
Code Code Available 5Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation Mar 12, 2024 Image Generation Language Modelling
Code Code Available 5ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Mar 8, 2024 Denoising Image Generation
Code Code Available 5CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion Mar 8, 2024 Computational Efficiency Image Generation
Code Code Available 5PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Mar 7, 2024 4k Image Captioning
Code Code Available 5Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Jan 22, 2024 Diffusion Personalization Tuning Free Image Generation
Code Code Available 5SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers Jan 16, 2024 Image Generation
Code Code Available 5IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models Aug 13, 2023 Diffusion Personalization Tuning Free Image Generation
Code Code Available 5Image Vectorization: a Review Jun 10, 2023 Image Generation Vector Graphics
Code Code Available 5Consistency Models Mar 2, 2023 Colorization Image Generation
Code Code Available 5Scalable Diffusion Models with Transformers Dec 19, 2022 Image Generation
Code Code Available 5DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models Nov 2, 2022 Image Generation Text to Image Generation
Code Code Available 5DreamFusion: Text-to-3D using 2D Diffusion Sep 29, 2022 Denoising Image Generation
Code Code Available 5MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation Sep 19, 2022 Decoder Image Generation
Code Code Available 5DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation Aug 25, 2022 Diffusion Personalization Image Generation
Code Code Available 5An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion Aug 2, 2022 Image Generation Personalized Image Generation
Code Code Available 5XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation Jun 26, 2025 Attribute Image Generation
Code Code Available 4Ming-Omni: A Unified Multimodal Model for Perception and Generation Jun 11, 2025 Image Generation text-to-speech
Code Code Available 4UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation Jun 3, 2025 Image Editing
Code Code Available 4ImgEdit: A Unified Image Editing Dataset and Benchmark May 26, 2025 Image Editing
Code Code Available 4Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO May 22, 2025 Domain Generalization Image Generation
Code Code Available 4Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning May 6, 2025 Image Generation
Code Code Available 4Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction May 5, 2025 Image Generation multimodal interaction
Code Code Available 4T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT May 1, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 4Lumina-Image 2.0: A Unified and Efficient Image Generative Framework Mar 27, 2025 Image Generation Text to Image Generation
Code Code Available 4WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation Mar 10, 2025 Common Sense Reasoning Image Generation
Code Code Available 4Unified Reward Model for Multimodal Understanding and Generation Mar 7, 2025 Image Generation model
Code Code Available 4Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step Jan 23, 2025 Image Generation Text-to-Image Generation
Code Code Available 4A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANs Jan 20, 2025 Diversity Image Generation
Code Code Available 4The GAN is dead; long live the GAN! A Modern GAN Baseline Jan 9, 2025 Image Generation
Code Code Available 4Autoregressive Video Generation without Vector Quantization Dec 18, 2024 Image Generation Prediction
Code Code Available 4Taming Scalable Visual Tokenizer for Autoregressive Image Generation Dec 3, 2024 Image Generation Image Reconstruction
Code Code Available 4One Diffusion to Generate Them All Nov 25, 2024 All Camera Pose Estimation
Code Code Available 4Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Nov 10, 2024 Attribute Image Generation
Code Code Available 4