AutoPresent: Designing Structured Visuals from Scratch Jan 1, 2025 Image Generation
Code Code Available 2Regression Guided Strategy to Automated Facial Beauty Optimization through Image Synthesis Jan 1, 2025 Image Generation regression
— Unverified 0Dual Diffusion for Unified Image Generation and Understanding Dec 31, 2024 Image Generation Language Modeling
Code Code Available 2Token Pruning for Caching Better: 9 Times Acceleration on Stable Diffusion for Free Dec 31, 2024 Denoising Image Generation
Code Code Available 0MLLM-as-a-Judge for Image Safety without Human Labeling Dec 31, 2024 Image Generation
— Unverified 0PQD: Post-training Quantization for Efficient Diffusion Models Dec 30, 2024 Diversity Image Generation
— Unverified 0Text-to-Image GAN with Pretrained Representations Dec 30, 2024 Domain Generalization Image Generation
— Unverified 0Quantum Diffusion Model for Quark and Gluon Jet Generation Dec 30, 2024 Denoising Image Generation
Code Code Available 0Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis Dec 30, 2024 counterfactual Image Generation
— Unverified 0VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control Dec 30, 2024 Denoising Image Generation
Code Code Available 2Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation Dec 30, 2024 3D Generation Image Generation
— Unverified 0Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition Dec 29, 2024 Data Augmentation Finger Vein Recognition
Code Code Available 0Open-Sora: Democratizing Efficient Video Production for All Dec 29, 2024 All Image Generation
Code Code Available 13Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond) Dec 29, 2024 Deblurring Image Generation
Code Code Available 1Diff4MMLiTS: Advanced Multimodal Liver Tumor Segmentation via Diffusion-Based Image Synthesis and Alignment Dec 29, 2024 Image Generation Segmentation
— Unverified 0FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation Dec 29, 2024 Fairness Image Generation
Code Code Available 1INFELM: In-depth Fairness Evaluation of Large Text-To-Image Models Dec 28, 2024 Fairness Image Generation
— Unverified 0Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games Dec 28, 2024 Image Generation LEMMA
— Unverified 0An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models Dec 28, 2024 Conditional Image Generation Image Generation
— Unverified 0Focusing Image Generation to Mitigate Spurious Correlations Dec 27, 2024 Attribute Data Augmentation
— Unverified 0Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales Dec 27, 2024 image-classification Image Classification
— Unverified 0P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision Dec 27, 2024 Image Generation
— Unverified 0Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation Dec 25, 2024 Denoising Image Generation
— Unverified 0Protective Perturbations against Unauthorized Data Usage in Diffusion-based Image Generation Dec 25, 2024 Image Generation
— Unverified 0DiFiC: Your Diffusion Model Holds the Secret to Fine-Grained Clustering Dec 25, 2024 Clustering Data Augmentation
— Unverified 0Elucidating Flow Matching ODE Dynamics with Respect to Data Geometries Dec 25, 2024 Image Generation Memorization
— Unverified 0UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation Dec 25, 2024 Image Generation Text to Image Generation
— Unverified 0DRDM: A Disentangled Representations Diffusion Model for Synthesizing Realistic Person Images Dec 25, 2024 Image Generation Pose Transfer
— Unverified 01.58-bit FLUX Dec 24, 2024 Computational Efficiency Image Generation
— Unverified 0Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models Dec 24, 2024 Image Generation
— Unverified 0Ensuring Consistency for In-Image Translation Dec 24, 2024 Image Generation Large Language Model
— Unverified 0EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation Dec 24, 2024 Image Captioning Image Generation
Code Code Available 2Extract Free Dense Misalignment from CLIP Dec 24, 2024 Hallucination Image Generation
Code Code Available 1Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction Dec 24, 2024 Face Generation Image Generation
— Unverified 0RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction Dec 24, 2024 Image Generation multimodal generation
— Unverified 0Personalized Large Vision-Language Models Dec 23, 2024 Image Generation
— Unverified 0Discriminative Image Generation with Diffusion Models for Zero-Shot Learning Dec 23, 2024 Image Generation Zero-Shot Learning
— Unverified 0Diffusion-Based Approaches in Medical Image Generation and Analysis Dec 22, 2024 Image Generation Medical Image Analysis
— Unverified 0RealisID: Scale-Robust and Fine-Controllable Identity Customization via Local and Global Complementation Dec 22, 2024 Image Generation
— Unverified 0Modular Conversational Agents for Surveys and Interviews Dec 22, 2024 AI Agent Image Generation
— Unverified 0HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories Dec 22, 2024 3D Shape Reconstruction Image Generation
— Unverified 0Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers Dec 22, 2024 Denoising Image Generation
— Unverified 0Human-Guided Image Generation for Expanding Small-Scale Training Image Datasets Dec 22, 2024 Image Generation object-detection
Code Code Available 0DreamOmni: Unified Image Generation and Editing Dec 22, 2024 Image Generation
— Unverified 0Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation Dec 22, 2024 Image Generation Text to Image Generation
Code Code Available 0Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching Dec 22, 2024 Image Generation Text to Image Generation
Code Code Available 2Adversarial Attack Against Images Classification based on Generative Adversarial Networks Dec 21, 2024 Adversarial Attack Decision Making
— Unverified 0When Worse is Better: Navigating the compression-generation tradeoff in visual tokenization Dec 20, 2024 Image Generation
— Unverified 0Stylish and Functional: Guided Interpolation Subject to Physical Constraints Dec 20, 2024 Image Generation
— Unverified 0CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up Dec 20, 2024 8k GPU
Code Code Available 3