Preliminary Explorations with GPT-4o(mni) Native Image Generation May 6, 2025 Image Generation multimodal generation
— Unverified 0Multimodal Benchmarking and Recommendation of Text-to-Image Generation Models May 6, 2025 Benchmarking Image Generation
Code Code Available 0Distribution-Conditional Generation: From Class Distribution to Creative Generation May 6, 2025 Image Generation
— Unverified 0Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning May 6, 2025 Image Generation
Code Code Available 4Safer Prompts: Reducing IP Risk in Visual Generative AI May 6, 2025 Image Generation Prompt Engineering
— Unverified 0Real-Time Person Image Synthesis Using a Flow Matching Model May 6, 2025 Image Generation Video Generation
Code Code Available 0Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation May 6, 2025 Image Generation Mamba
Code Code Available 1From Spaceborne to Airborne: SAR Image Synthesis Using Foundation Models for Multi-Scale Adaptation May 5, 2025 Image Generation
— Unverified 0Text to Image Generation and Editing: A Survey May 5, 2025 Image Generation Mamba
— Unverified 0MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation May 5, 2025 Image Generation Scene Generation
— Unverified 0Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities May 5, 2025 Image Generation Survey
Code Code Available 5Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction May 5, 2025 Image Generation multimodal interaction
Code Code Available 4Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models May 5, 2025 Image Generation
Code Code Available 0No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves May 5, 2025 Image Generation Representation Learning
Code Code Available 2Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset May 4, 2025 Image Generation Image-to-Image Translation
— Unverified 0Regression is all you need for medical image translation May 4, 2025 All Hallucination
Code Code Available 0RAGAR: Retrieval Augment Personalized Image Generation Guided by Recommendation May 3, 2025 Image Generation Personalized Image Generation
— Unverified 0Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling May 3, 2025 Image Generation Image Inpainting
— Unverified 0WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation May 2, 2025 Image Generation Text to Image Generation
— Unverified 0Improving Editability in Image Generation with Layer-wise Memory May 2, 2025 Disentanglement Image Generation
— Unverified 0JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers May 1, 2025 Depth Estimation Image Generation
— Unverified 0T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT May 1, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 4Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields Apr 30, 2025 Image Compression Image Generation
— Unverified 0Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Apr 30, 2025 Image Generation
Code Code Available 3YoChameleon: Personalized Vision and Language Generation Apr 29, 2025 Image Generation Text Generation
— Unverified 0PixelHacker: Image Inpainting with Structural and Semantic Consistency Apr 29, 2025 Denoising Image Generation
Code Code Available 3Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion Apr 29, 2025 Action Generation FAD
— Unverified 0A Picture is Worth a Thousand Prompts? Efficacy of Iterative Human-Driven Prompt Refinement in Image Regeneration Tasks Apr 29, 2025 Image Generation
— Unverified 0Inception: Jailbreak the Memory Mechanism of Text-to-Image Generation Systems Apr 29, 2025 Image Generation Text to Image Generation
— Unverified 0Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions Apr 27, 2025 Image Generation Motion Synthesis
Code Code Available 2HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models Apr 25, 2025 Denoising Diagnostic
— Unverified 0DiffUMI: Training-Free Universal Model Inversion via Unconditional Diffusion for Face Recognition Apr 25, 2025 Face Generation Face Recognition
— Unverified 0RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Apr 24, 2025 Image Generation Text to Image Generation
— Unverified 0Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Apr 24, 2025 Image Generation Text Generation
— Unverified 0Fast Autoregressive Models for Continuous Latent Generation Apr 24, 2025 Denoising Image Generation
— Unverified 0DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition Apr 24, 2025 Disentanglement Image Generation
— Unverified 0FashionM3: Multimodal, Multitask, and Multiround Fashion Assistant based on Unified Vision-Language Model Apr 24, 2025 Image Generation Language Modeling
— Unverified 0ePBR: Extended PBR Materials in Image Synthesis Apr 23, 2025 Image Generation
— Unverified 0Distilling semantically aware orders for autoregressive image generation Apr 23, 2025 Image Generation Text Generation
— Unverified 0UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing Apr 22, 2025 Depth Estimation Image Generation
— Unverified 0FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image Generation Apr 22, 2025 Image Generation Text to Image Generation
Code Code Available 1Emergence and Evolution of Interpretable Concepts in Diffusion Models Apr 21, 2025 Image Generation Text to Image Generation
— Unverified 0Twin Co-Adaptive Dialogue for Progressive Image Generation Apr 21, 2025 Image Generation Text to Image Generation
— Unverified 0VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation Apr 21, 2025 Conditional Image Generation Depth Estimation
— Unverified 0Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration Apr 21, 2025 Image Generation Image Restoration
— Unverified 0TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models Apr 21, 2025 Image Generation Image Segmentation
— Unverified 0What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale Apr 21, 2025 image-classification Image Classification
— Unverified 0Causal Disentanglement for Robust Long-tail Medical Image Generation Apr 20, 2025 counterfactual Disentanglement
— Unverified 0Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens Apr 20, 2025 Attribute Image Generation
— Unverified 0REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models Apr 20, 2025 Attribute Image Generation
— Unverified 0