DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Mar 13, 2025 Image Generation Text to Image Generation
— Unverified 0RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models Mar 13, 2025 Image Generation In-Context Learning
— Unverified 0AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption Mar 13, 2025 Image Generation
Code Code Available 1MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment Mar 13, 2025 Image Generation
Code Code Available 0ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content Mar 13, 2025 Benchmarking Image Generation
— Unverified 0Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation Mar 13, 2025 Image Generation
— Unverified 0Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation Mar 12, 2025 Attribute Denoising
— Unverified 0Neighboring Autoregressive Modeling for Efficient Visual Generation Mar 12, 2025 Image Generation Text to Image Generation
Code Code Available 2Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion Mar 12, 2025 Descriptive Image Generation
— Unverified 0Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models Mar 12, 2025 Attribute Diversity
— Unverified 0NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers Mar 12, 2025 Image Generation
— Unverified 0UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer Mar 12, 2025 Image Generation
— Unverified 0Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets Mar 12, 2025 Active Learning Conditional Image Generation
— Unverified 0Decoupled Doubly Contrastive Learning for Cross Domain Facial Action Unit Detection Mar 12, 2025 Action Unit Detection Contrastive Learning
— Unverified 0PromptMap: An Alternative Interaction Style for AI-Based Image Generation Mar 12, 2025 Image Generation Semantic Similarity
Code Code Available 0Revealing Unintentional Information Leakage in Low-Dimensional Facial Portrait Representations Mar 12, 2025 Image Generation
Code Code Available 0DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction Mar 12, 2025 Image Generation
Code Code Available 0FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model Mar 12, 2025 Image Generation
— Unverified 0Aligning Text to Image in Diffusion Models is Easier Than You Think Mar 11, 2025 Contrastive Learning Image Generation
Code Code Available 1OminiControl2: Efficient Conditioning for Diffusion Transformers Mar 11, 2025 Conditional Image Generation Denoising
Code Code Available 5Robust Latent Matters: Boosting Image Generation with Sampling Error Mar 11, 2025 Benchmarking Image Generation
Code Code Available 3A Deep Bayesian Nonparametric Framework for Robust Mutual Information Estimation Mar 11, 2025 Image Generation Mutual Information Estimation
— Unverified 0Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens Mar 11, 2025 Decoder Image Generation
Code Code Available 0GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing Mar 11, 2025 3D Reconstruction Depth Estimation
— Unverified 0Generating Robot Constitutions & Benchmarks for Semantic Safety Mar 11, 2025 Collision Avoidance Image Generation
— Unverified 0LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization Mar 11, 2025 GPU Image Generation
Code Code Available 2Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models? Mar 10, 2025 Contrastive Learning Image Generation
— Unverified 0Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Mar 10, 2025 Image Description Image Generation
Code Code Available 2Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Mar 10, 2025 Diversity Image Generation
Code Code Available 0AI for Just Work: Constructing Diverse Imaginations of AI beyond "Replacing Humans" Mar 10, 2025 Image Generation
— Unverified 0Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping Mar 10, 2025 Denoising Image Generation
— Unverified 0LatexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending Mar 10, 2025 Computational Efficiency Denoising
— Unverified 0NFIG: Autoregressive Image Generation with Next-Frequency Prediction Mar 10, 2025 Image Generation Prediction
— Unverified 0Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting Mar 10, 2025 Image Generation
— Unverified 0EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Mar 10, 2025 Computational Efficiency Image Generation
— Unverified 0Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help Mar 10, 2025 Image Generation Text to Image Generation
— Unverified 0Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment Mar 10, 2025 Domain Adaptation Image Generation
Code Code Available 1Synthetic Lung X-ray Generation through Cross-Attention and Affinity Transformation Mar 10, 2025 Image Generation Medical Image Analysis
— Unverified 0NukesFormers: Unpaired Hyperspectral Image Generation with Non-Uniform Domain Alignment Mar 10, 2025 Contrastive Learning Image Generation
— Unverified 0WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation Mar 10, 2025 Common Sense Reasoning Image Generation
Code Code Available 4TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models Mar 10, 2025 Contrastive Learning Denoising
Code Code Available 1V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation Mar 10, 2025 Decoder Image Generation
Code Code Available 1NeAS: 3D Reconstruction from X-ray Images using Neural Attenuation Surface Mar 10, 2025 3D Reconstruction Image Generation
— Unverified 0Effective and Efficient Masked Image Generation Models Mar 10, 2025 Image Generation
Code Code Available 1TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation Mar 10, 2025 Denoising Image Generation
— Unverified 0ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy Mar 9, 2025 Decoder Image Generation
— Unverified 0Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation Mar 9, 2025 Attribute Image Generation
Code Code Available 0Generative modelling with jump-diffusions Mar 9, 2025 Image Generation
Code Code Available 0DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability Mar 9, 2025 Contrastive Learning Facial Editing
— Unverified 0Adding Additional Control to One-Step Diffusion with Joint Distribution Matching Mar 9, 2025 Image Generation
— Unverified 0