MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation May 25, 2025 Image Generation Image Reconstruction
Code Code Available 1OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks May 24, 2025 Image Generation Instruction Following
Code Code Available 1Taming Diffusion for Dataset Distillation with High Representativeness May 23, 2025 Dataset Distillation Image Generation
Code Code Available 1RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning May 23, 2025 Image Generation Language Modeling
Code Code Available 1Co-Reinforcement Learning for Unified Multimodal Understanding and Generation May 23, 2025 Image Generation reinforcement-learning
Code Code Available 1Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On May 22, 2025 Image Generation Virtual Try-on
Code Code Available 1Forward-only Diffusion Probabilistic Models May 22, 2025 Conditional Image Generation Image Dehazing
Code Code Available 1Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation May 21, 2025 Image Generation
Code Code Available 1Training-Free Watermarking for Autoregressive Image Generation May 20, 2025 Image Generation
Code Code Available 1Accelerate TarFlow Sampling with GS-Jacobi Iteration May 19, 2025 Image Generation
Code Code Available 1VTBench: Evaluating Visual Tokenizers for Autoregressive Image Generation May 19, 2025 Image Generation Image Reconstruction
Code Code Available 1Is Artificial Intelligence Generated Image Detection a Solved Problem? May 18, 2025 Data Augmentation Image Generation
Code Code Available 1FastCar: Cache Attentive Replay for Fast Auto-Regressive Video Generation on the Edge May 17, 2025 Image Generation Scheduling
Code Code Available 1Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models May 16, 2025 Image Generation
Code Code Available 1One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework May 16, 2025 Attribute Image Generation
Code Code Available 1Mini Diffuser: Fast Multi-task Diffusion Policy Training Using Two-level Mini-batches May 14, 2025 Action Generation Image Generation
Code Code Available 1Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition May 9, 2025 Image Generation
Code Code Available 1A Preliminary Study for GPT-4o on Image Restoration May 8, 2025 Image Dehazing Image Generation
Code Code Available 1Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation May 6, 2025 Image Generation Mamba
Code Code Available 1FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image Generation Apr 22, 2025 Image Generation Text to Image Generation
Code Code Available 1U-Shape Mamba: State Space Model for faster diffusion Apr 18, 2025 Decoder Image Generation
Code Code Available 1SupResDiffGAN a new approach for the Super-Resolution task Apr 18, 2025 Image Generation Super-Resolution
Code Code Available 1SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding Apr 17, 2025 Image Generation Large Language Model
Code Code Available 1Personalized Text-to-Image Generation with Auto-Regressive Models Apr 17, 2025 Image Generation Personalized Image Generation
Code Code Available 1DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging Apr 16, 2025 Image Generation model
Code Code Available 1Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception Apr 15, 2025 Data Augmentation Denoising
Code Code Available 1Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing Apr 14, 2025 Image Generation Text to Image Generation
Code Code Available 1GeoUni: A Unified Model for Generating Geometry Diagrams, Problems and Problem Solutions Apr 14, 2025 Image Generation
Code Code Available 1Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes Apr 14, 2025 Image Generation Large Language Model
Code Code Available 1LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs Apr 11, 2025 Benchmarking Image Generation
Code Code Available 1Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging Apr 11, 2025 Attribute Computational Efficiency
Code Code Available 1ID-Booth: Identity-consistent Face Generation with Diffusion Models Apr 10, 2025 Denoising Diversity
Code Code Available 1A Unified Agentic Framework for Evaluating Conditional Image Generation Apr 9, 2025 Conditional Image Generation Image Generation
Code Code Available 1DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation Apr 9, 2025 Image Generation Text to Image Generation
Code Code Available 1An Empirical Study of GPT-4o Image Generation Capabilities Apr 8, 2025 Benchmarking Image Generation
Code Code Available 1Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking Apr 8, 2025 Image Generation
Code Code Available 1MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Apr 1, 2025 Image Generation Image Reconstruction
Code Code Available 1Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability Mar 26, 2025 Age/Unbiased Decision Making
Code Code Available 1Equivariant Image Modeling Mar 24, 2025 Image Generation Zero-shot Generalization
Code Code Available 1U-REPA: Aligning Diffusion U-Nets to ViTs Mar 24, 2025 Image Generation
Code Code Available 1Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models Mar 24, 2025 Image Generation Super-Resolution
Code Code Available 1DeLoRA: Decoupling Angles and Strength in Low-rank Adaptation Mar 23, 2025 Image Generation Natural Language Understanding
Code Code Available 1Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation Mar 23, 2025 Diversity Image Generation
Code Code Available 1Multi-focal Conditioned Latent Diffusion for Person Image Synthesis Mar 19, 2025 Image Generation
Code Code Available 1Efficient Personalization of Quantized Diffusion Model without Backpropagation Mar 19, 2025 Image Generation
Code Code Available 1SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model Mar 18, 2025 Autonomous Driving Image Generation
Code Code Available 1DPImageBench: A Unified Benchmark for Differentially Private Image Synthesis Mar 18, 2025 Image Generation
Code Code Available 1Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation Mar 16, 2025 Image Generation Specificity
Code Code Available 1Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models Mar 14, 2025 Image Generation Novel View Synthesis
Code Code Available 1AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption Mar 13, 2025 Image Generation
Code Code Available 1