No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves May 5, 2025 Image Generation Representation Learning
Code Code Available 2Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions Apr 27, 2025 Image Generation Motion Synthesis
Code Code Available 2Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off Apr 17, 2025 Garment Reconstruction Image Generation
Code Code Available 2Flux Already Knows -- Activating Subject-Driven Image Generation without Training Apr 12, 2025 Image Generation Virtual Try-on
Code Code Available 2OmniCaptioner: One Captioner to Rule Them All Apr 9, 2025 All Image Captioning
Code Code Available 2HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance Apr 8, 2025 Image Generation
Code Code Available 2Gaussian Mixture Flow Matching Models Apr 7, 2025 Denoising Image Generation
Code Code Available 2UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding Apr 6, 2025 Image Generation
Code Code Available 2ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement Apr 2, 2025 Decoder Image Generation
Code Code Available 2TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Mar 30, 2025 2k Image Generation
Code Code Available 2Harmonizing Visual Representations for Unified Multimodal Understanding and Generation Mar 27, 2025 Image Generation Quantization
Code Code Available 2Unified Multimodal Discrete Diffusion Mar 26, 2025 Image Captioning Image Generation
Code Code Available 2Scaling Down Text Encoders of Text-to-Image Diffusion Models Mar 25, 2025 GPU Image Generation
Code Code Available 2Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing Mar 25, 2025 Image Dehazing Image Generation
Code Code Available 2Single Image Iterative Subject-driven Generation and Editing Mar 20, 2025 Image Generation
Code Code Available 2Ultra-Resolution Adaptation with Ease Mar 20, 2025 2k 4k
Code Code Available 2Tokenize Image as a Set Mar 20, 2025 Image Generation
Code Code Available 2GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching Mar 17, 2025 Autonomous Driving Image Generation
Code Code Available 2Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection Mar 15, 2025 Image Generation Text to Image Generation
Code Code Available 2Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards Mar 14, 2025 Denoising Image Generation
Code Code Available 2Autoregressive Image Generation with Randomized Parallel Decoding Mar 13, 2025 Conditional Image Generation Image Generation
Code Code Available 2Neighboring Autoregressive Modeling for Efficient Visual Generation Mar 12, 2025 Image Generation Text to Image Generation
Code Code Available 2LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization Mar 11, 2025 GPU Image Generation
Code Code Available 2Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Mar 10, 2025 Image Description Image Generation
Code Code Available 2Learning Few-Step Diffusion Models by Trajectory Distribution Matching Mar 9, 2025 Image Generation Text to Image Generation
Code Code Available 2X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation Mar 8, 2025 GPU Image Generation
Code Code Available 2USP: Unified Self-Supervised Pretraining for Image Generation and Understanding Mar 8, 2025 Image Generation Representation Learning
Code Code Available 2Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator Mar 3, 2025 Image Generation
Code Code Available 2Geodesic Diffusion Models for Medical Image-to-Image Generation Mar 2, 2025 Denoising Image Denoising
Code Code Available 2Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality Mar 1, 2025 Image Enhancement Image Generation
Code Code Available 2MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing Feb 28, 2025 Image Generation Transfer Learning
Code Code Available 2Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Feb 27, 2025 Image Generation Text to Image Generation
Code Code Available 2FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction Feb 27, 2025 Image Generation Prediction
Code Code Available 2Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions Feb 24, 2025 Data Augmentation Image Generation
Code Code Available 2CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation Feb 18, 2025 Image Generation Text to Image Generation
Code Code Available 2Diffusion Models without Classifier-free Guidance Feb 17, 2025 Conditional Image Generation Image Generation
Code Code Available 2TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation Feb 11, 2025 Image Generation
Code Code Available 2Differentially Private Synthetic Data via APIs 3: Using Simulators Instead of Foundation Model Feb 8, 2025 Image Generation
Code Code Available 2CTR-Driven Advertising Image Generation with Multimodal Large Language Models Feb 5, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 2On the Guidance of Flow Matching Feb 4, 2025 Decision Making Image Generation
Code Code Available 2Compressed Image Generation with Denoising Diffusion Codebook Models Feb 3, 2025 Conditional Image Generation Denoising
Code Code Available 2VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking Jan 24, 2025 Denoising Image Generation
Code Code Available 2AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Jan 16, 2025 Image Generation Text to Image Generation
Code Code Available 2AutoPresent: Designing Structured Visuals from Scratch Jan 1, 2025 Image Generation
Code Code Available 2Dual Diffusion for Unified Image Generation and Understanding Dec 31, 2024 Image Generation Language Modeling
Code Code Available 2VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control Dec 30, 2024 Denoising Image Generation
Code Code Available 2EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation Dec 24, 2024 Image Captioning Image Generation
Code Code Available 2Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching Dec 22, 2024 Image Generation Text to Image Generation
Code Code Available 2Personalized Representation from Personalized Generation Dec 20, 2024 Contrastive Learning Image Generation
Code Code Available 2Next Patch Prediction for Autoregressive Visual Generation Dec 19, 2024 Image Generation Prediction
Code Code Available 2