Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis Apr 11, 2024 Image Generation
Code Code Available 0Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models -- Technical Challenges and Implications for Monitoring and Verification Apr 11, 2024 Image Generation
— Unverified 0Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models Apr 11, 2024 Image Generation
Code Code Available 1CAT: Contrastive Adapter Training for Personalized Image Generation Apr 11, 2024 Consistent Character Generation Diversity
Code Code Available 0UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion Apr 10, 2024 3D Shape Generation Image Generation
Code Code Available 1Deep Generative Data Assimilation in Multimodal Setting Apr 10, 2024 Image Generation Uncertainty Quantification
Code Code Available 1A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks Apr 10, 2024 Diversity Image Generation
Code Code Available 0GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis Apr 9, 2024 Image Generation Zero-shot Generalization
Code Code Available 2Hyperparameter-Free Medical Image Synthesis for Sharing Data and Improving Site-Specific Segmentation Apr 9, 2024 Image Generation
Code Code Available 0StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion Apr 9, 2024 Image Generation Story Visualization
Code Code Available 1DiffHarmony: Latent Diffusion Model Meets Image Harmonization Apr 9, 2024 Image Compression Image Generation
Code Code Available 1High Noise Scheduling is a Must Apr 9, 2024 Denoising Image Generation
— Unverified 0Tackling Structural Hallucination in Image Translation with Local Diffusion Apr 9, 2024 Hallucination Image Generation
Code Code Available 1Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt Apr 8, 2024 Image Generation Text to Image Generation
— Unverified 0MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Apr 8, 2024 Image Generation Image-to-Image Translation
Code Code Available 3UniFL: Improve Latent Diffusion Model via Unified Feedback Learning Apr 8, 2024 Image Generation Text to Image Generation
— Unverified 0Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI Apr 8, 2024 Image Generation
— Unverified 0Automatic Controllable Colorization via Imagination Apr 8, 2024 Colorization Image Generation
— Unverified 0SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing Apr 8, 2024 Image Generation Object
— Unverified 0MC^2: Multi-concept Guidance for Customized Multi-concept Generation Apr 8, 2024 Image Generation Text to Image Generation
Code Code Available 1StyleForge: Enhancing Text-to-Image Synthesis for Any Artistic Styles with Dual Binding Apr 8, 2024 Image Generation
— Unverified 0Strictly-ID-Preserved and Controllable Accessory Advertising Image Generation Apr 7, 2024 Image Generation
— Unverified 0ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model Apr 7, 2024 Image Generation Marketing
— Unverified 0Contextual Chart Generation for Cyber Deception Apr 7, 2024 Data Interaction Image Generation
— Unverified 0Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models Apr 6, 2024 Image Generation Unconditional Image Generation
Code Code Available 2Pixel-wise RL on Diffusion Models: Reinforcement Learning from Rich Feedback Apr 5, 2024 Denoising Image Generation
— Unverified 0Dynamic Prompt Optimizing for Text-to-Image Generation Apr 5, 2024 Image Generation Text to Image Generation
Code Code Available 2PHISWID: Physics-Inspired Underwater Image Dataset Synthesized from RGB-D Images Apr 5, 2024 Image Enhancement Image Generation
Code Code Available 0Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation Apr 5, 2024 Image Generation
Code Code Available 2Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models Apr 5, 2024 Image Generation Text to Image Generation
— Unverified 0No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance Apr 4, 2024 Benchmarking Image Generation
Code Code Available 2Multi Positive Contrastive Learning with Pose-Consistent Generated Images Apr 4, 2024 Contrastive Learning Image Generation
— Unverified 0CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Apr 4, 2024 Attribute Image Captioning
Code Code Available 2Diverse and Tailored Image Generation for Zero-shot Multi-label Classification Apr 4, 2024 Image Generation Language Modelling
— Unverified 0GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis Apr 4, 2024 Image Generation Image Reconstruction
— Unverified 0Would Deep Generative Models Amplify Bias in Future Models? Apr 4, 2024 Image Captioning Image Generation
— Unverified 0Reference-Based 3D-Aware Image Editing with Triplanes Apr 4, 2024 3D geometry Disentanglement
— Unverified 0InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation Apr 3, 2024 Image Generation Text to Image Generation
Code Code Available 7Many-to-many Image Generation with Auto-regressive Diffusion Models Apr 3, 2024 Image Generation Novel View Synthesis
— Unverified 0MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Apr 3, 2024 Image Generation Retrieval
— Unverified 0MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation Apr 3, 2024 Image Generation Prompt Engineering
— Unverified 0On the Scalability of Diffusion-based Text-to-Image Generation Apr 3, 2024 Denoising Diversity
— Unverified 0Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Apr 3, 2024 Image Generation Image Reconstruction
Code Code Available 9Heat Death of Generative Models in Closed-Loop Learning Apr 2, 2024 Image Generation
— Unverified 0Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models Apr 2, 2024 Adversarial Attack Image Generation
— Unverified 0Diffusion^2: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models Apr 2, 2024 3D Generation 4D reconstruction
Code Code Available 2Bi-LORA: A Vision-Language Approach for Synthetic Image Detection Apr 2, 2024 Binary Classification Image Captioning
Code Code Available 1Real, fake and synthetic faces - does the coin have three sides? Apr 2, 2024 Face Swapping Image Generation
— Unverified 0Model-Agnostic Human Preference Inversion in Diffusion Models Apr 1, 2024 Image Generation model
— Unverified 0Towards Label-Efficient Human Matting: A Simple Baseline for Weakly Semi-Supervised Trimap-Free Human Matting Apr 1, 2024 Domain Generalization GPU
Code Code Available 0