StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation May 26, 2025 Image Generation Instruction Following
— Unverified 0MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models May 26, 2025 Image Generation Visual Question Answering (VQA)
— Unverified 0FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities May 26, 2025 Image Generation
— Unverified 0Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning May 26, 2025 Classification image-classification
— Unverified 0Training-free Stylized Text-to-Image Generation with Fast Inference May 25, 2025 Image Generation Text to Image Generation
— Unverified 0DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving May 25, 2025 Autonomous Driving Image Generation
— Unverified 0Plug-and-Play Context Feature Reuse for Efficient Masked Generation May 25, 2025 Image Generation
— Unverified 0RAISE: Realness Assessment for Image Synthesis and Evaluation May 25, 2025 Image Generation
Code Code Available 0Towards Understanding the Mechanisms of Classifier-Free Guidance May 25, 2025 Image Generation
— Unverified 0TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis May 25, 2025 CPU GPU
— Unverified 0Mod-Adapter: Tuning-Free and Versatile Multi-concept Personalization via Modulation Adapter May 24, 2025 Image Generation Mixture-of-Experts
— Unverified 0How to build a consistency model: Learning flow maps via self-distillation May 24, 2025 Image Generation
— Unverified 0Test-Time Scaling of Diffusion Models via Noise Trajectory Search May 24, 2025 Denoising Image Generation
Code Code Available 0Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking May 24, 2025 Image Generation Language Modelling
— Unverified 0TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP May 24, 2025 Image Captioning Image Generation
— Unverified 0Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation May 24, 2025 Image Generation Text to Image Generation
Code Code Available 0RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration May 23, 2025 All Denoising
— Unverified 0F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles May 23, 2025 Dataset Generation Image Generation
Code Code Available 0MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation May 23, 2025 Audio Generation Benchmarking
— Unverified 0MRI Image Generation Based on Text Prompts May 23, 2025 Image Generation MS-SSIM
— Unverified 0FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving May 23, 2025 Autonomous Driving Image Generation
— Unverified 0CONCORD: Concept-Informed Diffusion for Dataset Distillation May 23, 2025 Computational Efficiency Dataset Distillation
Code Code Available 0TensorAR: Refinement is All You Need in Autoregressive Image Generation May 22, 2025 All Image Generation
— Unverified 0NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment May 22, 2025 Image Generation Image Restoration
— Unverified 0Creatively Upscaling Images with Global-Regional Priors May 22, 2025 Denoising Descriptive
— Unverified 0FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-design May 22, 2025 GPU Image Generation
Code Code Available 0Conditional Panoramic Image Generation via Masked Autoregressive Modeling May 22, 2025 ERP Image Generation
— Unverified 0Self-Rewarding Large Vision-Language Models for Optimizing Prompts in Text-to-Image Generation May 22, 2025 Image Generation Text to Image Generation
— Unverified 0Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation May 21, 2025 Image Generation Text to Image Generation
— Unverified 0MMaDA: Multimodal Large Diffusion Language Models May 21, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 0IA-T2I: Internet-Augmented Text-to-Image Generation May 21, 2025 Image Generation Image Retrieval
— Unverified 0Contrastive Learning-Enhanced Trajectory Matching for Small-Scale Dataset Distillation May 21, 2025 Contrastive Learning Dataset Distillation
— Unverified 0Generative AI for Autonomous Driving: A Review May 21, 2025 Autonomous Driving Image Generation
— Unverified 0FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion May 21, 2025 Attribute Diversity
— Unverified 0PO-Flow: Flow-based Generative Models for Sampling Potential Outcomes and Counterfactuals May 21, 2025 Causal Inference counterfactual
— Unverified 0Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling May 20, 2025 3D Generation 3D Reconstruction
— Unverified 0Latent Flow Transformer May 20, 2025 Image Generation
Code Code Available 0Instructing Text-to-Image Diffusion Models via Classifier-Guided Semantic Optimization May 20, 2025 Attribute Disentanglement
Code Code Available 0Hunyuan-Game: Industrial-grade Intelligent Game Creation Model May 20, 2025 Image Generation Image to Video Generation
— Unverified 0Adaptive Cyclic Diffusion for Inference Scaling May 20, 2025 Computational Efficiency Denoising
— Unverified 0UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation May 20, 2025 Image Generation Language Modeling
— Unverified 0"Haet Bhasha aur Diskrimineshun": Phonetic Perturbations in Code-Mixed Hinglish to Red-Team LLMs May 20, 2025 Image Generation Red Teaming
— Unverified 0Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives May 20, 2025 Caption Generation Contrastive Learning
— Unverified 0Swin DiT: Diffusion Transformer using Pseudo Shifted Windows May 19, 2025 Image Generation
Code Code Available 0MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO May 19, 2025 Decoder Image Generation
Code Code Available 0FRAbench and GenEval: Scaling Fine-Grained Aspect Evaluation across Tasks, Modalities May 19, 2025 Image Generation Text Generation
— Unverified 0SounDiT: Geo-Contextual Soundscape-to-Landscape Generation May 19, 2025 Image Generation
— Unverified 0A Physics-Inspired Optimizer: Velocity Regularized Adam May 19, 2025 image-classification Image Classification
— Unverified 0Diffusion Models with Double Guidance: Generate with aggregated datasets May 19, 2025 Image Generation
— Unverified 0Improving Compositional Generation with Diffusion Models Using Lift Scores May 19, 2025 Image Generation Position
Code Code Available 0