Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin May 30, 2025 Denoising Image Generation
Code Code Available 1Graph Flow Matching: Enhancing Image Generation with Neighbor-Aware Flow Fields May 30, 2025 Image Generation
— Unverified 0Category-aware EEG image generation based on wavelet transform and contrast semantic loss May 30, 2025 EEG Image Generation
Code Code Available 0Multi-Group Proportional Representation for Text-to-Image Models May 29, 2025 Image Generation
— Unverified 0VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL May 29, 2025 Arithmetic Reasoning Image Generation
— Unverified 0VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration May 29, 2025 Image Generation Semantic Segmentation
Code Code Available 0Inference-time Scaling of Diffusion Models through Classical Search May 29, 2025 Image Generation Navigate
— Unverified 0Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model May 29, 2025 Decoder Image Generation
Code Code Available 2R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation May 29, 2025 Benchmarking Image Generation
— Unverified 0LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers May 29, 2025 Denoising Image Generation
— Unverified 0Diffusion Sampling Path Tells More: An Efficient Plug-and-Play Strategy for Sample Filtering May 29, 2025 Denoising Image Generation
Code Code Available 0RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries May 29, 2025 Image Generation
— Unverified 0Image Aesthetic Reasoning: A New Benchmark for Medical Image Screening with MLLMs May 29, 2025 Image Generation Multiple-choice
— Unverified 0Implicit Inversion turns CLIP into a Decoder May 29, 2025 Decoder Image Generation
Code Code Available 0Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis May 29, 2025 Dimensionality Reduction Image Generation
— Unverified 0How Animals Dance (When You're Not Looking) May 29, 2025 Image Generation
— Unverified 0Cross-modal RAG: Sub-dimensional Retrieval-Augmented Text-to-Image Generation May 28, 2025 Image Generation Language Modeling
Code Code Available 0HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer May 28, 2025 Image Generation Mixture-of-Experts
Code Code Available 7Principled Out-of-Distribution Generalization via Simplicity May 28, 2025 Image Generation Out-of-Distribution Generalization
— Unverified 0Rhetorical Text-to-Image Generation via Two-layer Diffusion Policy Optimization May 28, 2025 Denoising Image Generation
— Unverified 0Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction May 27, 2025 3D Generation Image Generation
— Unverified 0DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction May 27, 2025 Image Generation
Code Code Available 2Unveiling Impact of Frequency Components on Membership Inference Attacks for Diffusion Models May 27, 2025 Image Generation
— Unverified 0Creativity in LLM-based Multi-Agent Systems: A Survey May 27, 2025 Image Generation Language Modeling
— Unverified 0Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning May 26, 2025 Classification image-classification
— Unverified 0FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities May 26, 2025 Image Generation
— Unverified 0ImgEdit: A Unified Image Editing Dataset and Benchmark May 26, 2025 Image Editing
Code Code Available 4Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots May 26, 2025 Image Generation Text to Image Generation
Code Code Available 1DiSA: Diffusion Step Annealing in Autoregressive Image Generation May 26, 2025 Denoising Image Generation
Code Code Available 2MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models May 26, 2025 Image Generation Visual Question Answering (VQA)
— Unverified 0StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation May 26, 2025 Image Generation Instruction Following
— Unverified 0Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion May 26, 2025 Denoising Image Generation
Code Code Available 1Plug-and-Play Context Feature Reuse for Efficient Masked Generation May 25, 2025 Image Generation
— Unverified 0STRICT: Stress Test of Rendering Images Containing Text May 25, 2025 Image Generation Instruction Following
Code Code Available 1DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving May 25, 2025 Autonomous Driving Image Generation
— Unverified 0Towards Understanding the Mechanisms of Classifier-Free Guidance May 25, 2025 Image Generation
— Unverified 0MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation May 25, 2025 Image Generation Image Reconstruction
Code Code Available 1TextDiffuser-RL: Efficient and Robust Text Layout Optimization for High-Fidelity Text-to-Image Synthesis May 25, 2025 CPU GPU
— Unverified 0Training-free Stylized Text-to-Image Generation with Fast Inference May 25, 2025 Image Generation Text to Image Generation
— Unverified 0RAISE: Realness Assessment for Image Synthesis and Evaluation May 25, 2025 Image Generation
Code Code Available 0Test-Time Scaling of Diffusion Models via Noise Trajectory Search May 24, 2025 Denoising Image Generation
Code Code Available 0Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking May 24, 2025 Image Generation Language Modelling
— Unverified 0How to build a consistency model: Learning flow maps via self-distillation May 24, 2025 Image Generation
— Unverified 0TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP May 24, 2025 Image Captioning Image Generation
— Unverified 0OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks May 24, 2025 Image Generation Instruction Following
Code Code Available 1Mod-Adapter: Tuning-Free and Versatile Multi-concept Personalization via Modulation Adapter May 24, 2025 Image Generation Mixture-of-Experts
— Unverified 0Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation May 24, 2025 Image Generation Text to Image Generation
Code Code Available 0MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation May 23, 2025 Audio Generation Benchmarking
— Unverified 0RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning May 23, 2025 Image Generation Language Modeling
Code Code Available 1FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving May 23, 2025 Autonomous Driving Image Generation
— Unverified 0