Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction May 21, 2024 Image Generation Prediction
— Unverified 0An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation May 21, 2024 Image Generation Language Modeling
Code Code Available 0CustomText: Customized Textual Image Generation using Diffusion Models May 21, 2024 Decoder Image Generation
— Unverified 0Personalized Residuals for Concept-Driven Text-to-Image Generation May 21, 2024 GPU Image Generation
— Unverified 0Diffusion for World Modeling: Visual Details Matter in Atari May 20, 2024 Image Generation reinforcement-learning
Code Code Available 5Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices May 20, 2024 Image Generation Video Editing
Code Code Available 2TriLoRA: Integrating SVD for Advanced Style Personalization in Text-to-Image Generation May 18, 2024 Image Generation Text to Image Generation
— Unverified 0UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual Checkers May 18, 2024 Image Generation Text to Image Generation
— Unverified 0On the Trajectory Regularity of ODE-based Diffusion Sampling May 18, 2024 Denoising Image Generation
Code Code Available 3Lean Attention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers May 17, 2024 Image Generation Text Generation
— Unverified 0Improving face generation quality and prompt following with synthetic captions May 17, 2024 Face Generation Image Generation
— Unverified 0UniRAG: Universal Retrieval Augmentation for Large Vision Language Models May 16, 2024 Image Captioning Image Generation
Code Code Available 1Chameleon: Mixed-Modal Early-Fusion Foundation Models May 16, 2024 Image Captioning Image Generation
Code Code Available 7MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis May 16, 2024 Image Generation
— Unverified 0VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing May 16, 2024 Human-Object Interaction Detection Image Generation
— Unverified 0KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment May 16, 2024 Depth Estimation Image Generation
— Unverified 0VisioBlend: Sketch and Stroke-Guided Denoising Diffusion Probabilistic Model for Realistic Image Generation May 15, 2024 Denoising Efficient Diffusion Personalization
— Unverified 0DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations May 15, 2024 counterfactual Image Generation
Code Code Available 0Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images May 15, 2024 Image Generation MS-SSIM
— Unverified 0Similarity and Quality Metrics for MR Image-To-Image Translation May 14, 2024 Image Generation Image-to-Image Translation
— Unverified 0Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding May 14, 2024 Image Generation Language Modeling
Code Code Available 7Compositional Text-to-Image Generation with Dense Blob Representations May 14, 2024 Image Generation In-Context Learning
— Unverified 0RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on Recurrent Affine Transformations May 13, 2024 Image Generation
Code Code Available 0SAR Image Synthesis with Diffusion Models May 13, 2024 Denoising Image Generation
— Unverified 0Quality Assessment for AI Generated Images with Instruction Tuning May 12, 2024 Image Generation Image Quality Assessment
Code Code Available 1Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation May 11, 2024 Attribute Image Generation
— Unverified 0Deep MMD Gradient Flow without adversarial training May 10, 2024 Denoising Image Generation
— Unverified 0Controllable Image Generation With Composed Parallel Token Prediction May 10, 2024 Image Generation Prediction
— Unverified 0MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation May 9, 2024 Image Generation Text to Image Generation
Code Code Available 2An Inversion-based Measure of Memorization for Diffusion Models May 9, 2024 Image Generation Memorization
Code Code Available 0Exploring Text-Guided Single Image Editing for Remote Sensing Images May 9, 2024 Image Generation
Code Code Available 0Frame Interpolation with Consecutive Brownian Bridge Diffusion May 9, 2024 Conditional Image Generation Image Generation
Code Code Available 1A Survey on Personalized Content Synthesis with Diffusion Models May 9, 2024 Face Generation Image Generation
— Unverified 0VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis May 9, 2024 Decoder Diversity
— Unverified 0Diffusion-HMC: Parameter Inference with Diffusion-model-driven Hamiltonian Monte Carlo May 8, 2024 Image Generation
Code Code Available 0Variational Schrödinger Diffusion Models May 8, 2024 Image Generation Variational Inference
— Unverified 0FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation May 8, 2024 Image Generation Text to Image Generation
— Unverified 0Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI May 8, 2024 Anomaly Detection Image Generation
— Unverified 0HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis May 8, 2024 Generative Adversarial Network Image Generation
— Unverified 0TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model May 7, 2024 Image Generation
— Unverified 0Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation May 7, 2024 Image Generation
Code Code Available 0Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer May 7, 2024 Image Generation Super-Resolution
Code Code Available 3Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model May 7, 2024 Image Generation
— Unverified 0ResNCT: A Deep Learning Model for the Synthesis of Nephrographic Phase Images in CT Urography May 7, 2024 Image Generation SSIM
— Unverified 0CCDM: Continuous Conditional Diffusion Models for Image Generation May 6, 2024 Denoising Image Generation
Code Code Available 1Generated Contents Enrichment May 6, 2024 Image Generation
— Unverified 0ImageInWords: Unlocking Hyper-Detailed Image Descriptions May 5, 2024 Image Generation Specificity
Code Code Available 3Data-Efficient Molecular Generation with Hierarchical Textual Inversion May 5, 2024 Drug Discovery Image Generation
Code Code Available 0U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers May 4, 2024 Image Generation Inductive Bias
Code Code Available 3Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI May 3, 2024 Image Generation
Code Code Available 0