What's Next? Exploring Utilization, Challenges, and Future Directions of AI-Generated Image Tools in Graphic Design Jun 19, 2024 Image Generation
— Unverified 0DF40: Toward Next-Generation Deepfake Detection Jun 19, 2024 DeepFake Detection Face Reenactment
Code Code Available 3Improving Visual Commonsense in Language Models via Multiple Image Generation Jun 19, 2024 Common Sense Reasoning Image Generation
Code Code Available 1Training Diffusion Models with Federated Learning Jun 18, 2024 Denoising Federated Learning
— Unverified 0Cyclic 2.5D Perceptual Loss for Cross-Modal 3D Medical Image Synthesis: T1w MRI to Tau PET Jun 18, 2024 Image Generation SSIM
Code Code Available 0AITTI: Learning Adaptive Inclusive Token for Text-to-Image Generation Jun 18, 2024 Attribute Fairness
Code Code Available 1ARTIST: Improving the Generation of Text-rich Images with Disentangled Diffusion Models and Large Language Models Jun 17, 2024 Disentanglement Image Generation
— Unverified 0Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models Jun 17, 2024 All Contrastive Learning
Code Code Available 1Decomposed evaluations of geographic disparities in text-to-image models Jun 17, 2024 Attribute Diversity
— Unverified 0GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation Jun 17, 2024 Image Generation Math
Code Code Available 0PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models Jun 17, 2024 Image Generation
— Unverified 0Discriminative Hamiltonian Variational Autoencoder for Accurate Tumor Segmentation in Data-Scarce Regimes Jun 17, 2024 Data Augmentation Image Generation
— Unverified 0Generative Visual Instruction Tuning Jun 17, 2024 Image Generation Image-text matching
Code Code Available 0Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% Jun 17, 2024 image-classification Image Classification
Code Code Available 2Autoregressive Image Generation without Vector Quantization Jun 17, 2024 Image Generation Quantization
Code Code Available 5Latent Denoising Diffusion GAN: Faster sampling, Higher image quality Jun 17, 2024 Denoising Diversity
Code Code Available 1Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models Jun 17, 2024 Decoder Image Generation
— Unverified 0Mixture-of-Subspaces in Low-Rank Adaptation Jun 16, 2024 Common Sense Reasoning Image Generation
Code Code Available 0STAR: Scale-wise Text-to-image generation via Auto-Regressive representations Jun 16, 2024 Diversity Image Generation
Code Code Available 2An Analysis on Quantizing Diffusion Transformers Jun 16, 2024 Conditional Image Generation Denoising
— Unverified 0Can Generative AI Replace Immunofluorescent Staining Processes? A Comparison Study of Synthetically Generated CellPainting Images from Brightfield Jun 15, 2024 Image Generation
— Unverified 0Poetry2Image: An Iterative Correction Framework for Images Generated from Chinese Classical Poetry Jun 15, 2024 Image Generation Text to Image Generation
— Unverified 0MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation Jun 15, 2024 AudioCaps Image Generation
Code Code Available 0Make It Count: Text-to-Image Generation with an Accurate Number of Objects Jun 14, 2024 Denoising Image Generation
Code Code Available 2Crafting Parts for Expressive Object Composition Jun 14, 2024 Denoising Image Generation
— Unverified 0ControlVAR: Exploring Controllable Visual Autoregressive Modeling Jun 14, 2024 Image Generation
Code Code Available 2Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation Jun 13, 2024 GPU Image Generation
— Unverified 0Understanding Hallucinations in Diffusion Models through Mode Interpolation Jun 13, 2024 Hallucination Image Generation
Code Code Available 2StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning Jun 13, 2024 Diversity Image Generation
— Unverified 0An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Jun 13, 2024 Image Generation Inductive Bias
— Unverified 0Batch-Instructed Gradient for Prompt Evolution:Systematic Prompt Optimization for Enhanced Text-to-Image Synthesis Jun 13, 2024 Image Generation Text to Image Generation
Code Code Available 0Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization Jun 13, 2024 Image Generation
Code Code Available 1EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Jun 13, 2024 Conditional Image Generation Image Generation
Code Code Available 5TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation Jun 12, 2024 Benchmarking Image Generation
Code Code Available 1WMAdapter: Adding WaterMark Control to Latent Diffusion Models Jun 12, 2024 Image Generation Transfer Learning
— Unverified 0Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation Jun 12, 2024 Image Generation Perceptual Distance
— Unverified 0FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation Jun 12, 2024 Image Generation Text to Image Generation
— Unverified 0DiTFastAttn: Attention Compression for Diffusion Transformer Models Jun 12, 2024 2k Image Generation
— Unverified 0VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks Jun 12, 2024 Image Generation Language Modeling
Code Code Available 5What If We Recaption Billions of Web Images with LLaMA-3? Jun 12, 2024 Cross-Modal Retrieval Image Generation
— Unverified 0Understanding and Mitigating Compositional Issues in Text-to-Image Generative Models Jun 12, 2024 Image Generation
Code Code Available 0CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models Jun 12, 2024 Image Generation text-guided-generation
Code Code Available 1Diffusion Soup: Model Merging for Text-to-Image Diffusion Models Jun 12, 2024 Continual Learning Image Generation
— Unverified 0Progress Towards Decoding Visual Imagery via fNIRS Jun 11, 2024 Image Generation Image Reconstruction
— Unverified 0Image and Video Tokenization with Binary Spherical Quantization Jun 11, 2024 Decoder Image Generation
Code Code Available 3An Image is Worth 32 Tokens for Reconstruction and Generation Jun 11, 2024 Image Generation Image Reconstruction
Code Code Available 3Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Jun 11, 2024 Hallucination Image Description
Code Code Available 2SPIN: Spacecraft Imagery for Navigation Jun 11, 2024 Data Augmentation Image Generation
Code Code Available 1Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration Jun 11, 2024 Decoder Image Generation
— Unverified 0Understanding Visual Concepts Across Models Jun 11, 2024 Image Generation object-detection
Code Code Available 0