SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models May 23, 2023 Image Generation
Code Code Available 0Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning May 23, 2023 Image Generation Optical Flow Estimation
Code Code Available 2Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach May 23, 2023 GPU Image Generation
Code Code Available 2Generalizable Synthetic Image Detection via Language-guided Contrastive Learning May 23, 2023 Contrastive Learning Image Generation
Code Code Available 1LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models May 23, 2023 Common Sense Reasoning Image Generation
Code Code Available 2Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models May 23, 2023 Attribute Image Generation
Code Code Available 1Parts of Speech-Grounded Subspaces in Vision-Language Models May 23, 2023 Image Generation POS
Code Code Available 1VisorGPT: Learning Visual Prior via Generative Pre-Training May 23, 2023 Image Generation Language Modeling
Code Code Available 1DIVA: A Dirichlet Process Mixtures Based Incremental Deep Clustering Algorithm via Variational Auto-Encoder May 23, 2023 Clustering Deep Clustering
Code Code Available 1Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation May 23, 2023 All Image Generation
Code Code Available 1DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection May 23, 2023 Image Generation
Code Code Available 1Variational Bayesian Framework for Advanced Image Generation with Domain-Related Variables May 23, 2023 Image Generation Image-to-Image Translation
— Unverified 0Design a Delicious Lunchbox in Style May 22, 2023 Generative Adversarial Network Image Generation
Code Code Available 0If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection May 22, 2023 Image Generation Text to Image Generation
Code Code Available 1ControlVideo: Training-free Controllable Text-to-Video Generation May 22, 2023 Image Generation Text-to-Video Generation
Code Code Available 2AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation May 22, 2023 audio-visual learning Image Generation
Code Code Available 1The CLIP Model is Secretly an Image-to-Prompt Converter May 22, 2023 Image Generation Image-Variation
— Unverified 0Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration May 22, 2023 Data Augmentation Image Generation
Code Code Available 1Quantifying the effect of X-ray scattering for data generation in real-time defect detection May 22, 2023 Defect Detection Image Generation
Code Code Available 0iWarpGAN: Disentangling Identity and Style to Generate Synthetic Iris Images May 21, 2023 Image Generation
— Unverified 0DiffUCD:Unsupervised Hyperspectral Image Change Detection with Semantic Correlation Diffusion Model May 21, 2023 Change Detection Contrastive Learning
— Unverified 0InstructVid2Vid: Controllable Video Editing with Natural Language Instructions May 21, 2023 Attribute Image Generation
— Unverified 0Dual-Diffusion: Dual Conditional Denoising Diffusion Probabilistic Models for Blind Super-Resolution Reconstruction in RSIs May 20, 2023 Blind Super-Resolution Denoising
Code Code Available 1DiffCap: Exploring Continuous Diffusion on Image Captioning May 20, 2023 Caption Generation Diversity
— Unverified 0Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization May 19, 2023 Image Generation Position
Code Code Available 1ReDirTrans: Latent-to-Latent Translation for Gaze and Head Redirection May 19, 2023 Attribute Gaze Estimation
— Unverified 0LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis May 19, 2023 Conditional Image Generation Conditional Text-to-Image Synthesis
Code Code Available 1Generative Sliced MMD Flows with Riesz Kernels May 19, 2023 Image Generation
Code Code Available 0Few-shot 3D Shape Generation May 19, 2023 3D Shape Generation Diversity
— Unverified 0LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model May 19, 2023 Image Generation Image Inpainting
Code Code Available 1Efficient Cross-Lingual Transfer for Chinese Stable Diffusion with Images as Pivots May 19, 2023 Cross-Lingual Transfer Image Generation
— Unverified 0Constructing Dreams using Generative AI May 19, 2023 Image Generation Prompt Engineering
— Unverified 0LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation May 19, 2023 Image Generation Instruction Following
Code Code Available 1PTQD: Accurate Post-Training Quantization for Diffusion Models May 18, 2023 Denoising Image Generation
Code Code Available 1SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models May 18, 2023 Image Generation Object
— Unverified 0OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding May 18, 2023 3D Classification 3D Shape Representation
Code Code Available 2UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild May 18, 2023 Image Generation
Code Code Available 2LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation May 18, 2023 Attribute Image Generation
Code Code Available 1RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture May 18, 2023 Image Generation Indoor Scene Synthesis
— Unverified 0Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation May 18, 2023 Image Generation Text Generation
— Unverified 0Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces May 18, 2023 Image Generation
Code Code Available 1Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models May 18, 2023 Backdoor Attack Image Generation
— Unverified 0Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model May 18, 2023 Image Generation Language Modeling
Code Code Available 2AIwriting: Relations Between Image Generation and Digital Writing May 18, 2023 Image Generation Text Generation
— Unverified 0X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models May 18, 2023 Benchmarking Image Generation
Code Code Available 1Private Gradient Estimation is Useful for Generative Modeling May 18, 2023 Image Generation Privacy Preserving
— Unverified 0Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners May 18, 2023 Image Generation Image-text matching
Code Code Available 1Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation May 18, 2023 Image Generation Text to Image Generation
Code Code Available 1What You See is What You Read? Improving Text-Image Alignment Evaluation May 17, 2023 Image Generation Image to text
Code Code Available 1Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models May 17, 2023 Image Generation Text-to-Video Generation
— Unverified 0