Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation Feb 17, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 1Diffusion Models without Classifier-free Guidance Feb 17, 2025 Conditional Image Generation Image Generation
Code Code Available 2GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs Feb 17, 2025 Image Generation Language Modeling
— Unverified 0A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond Feb 17, 2025 Contrastive Learning EEG
— Unverified 0Multi-Faceted Multimodal Monosemanticity Feb 16, 2025 Attribute Image Generation
— Unverified 0ManiTrend: Bridging Future Generation and Action Prediction with 3D Flow for Robotic Manipulation Feb 14, 2025 Image Generation Prediction
— Unverified 0SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models Feb 14, 2025 Image Generation
Code Code Available 0When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models Feb 13, 2025 Image Generation Sentence
— Unverified 0Redistribute Ensemble Training for Mitigating Memorization in Diffusion Models Feb 13, 2025 Image Generation Memorization
Code Code Available 0EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling Feb 13, 2025 Image Generation
— Unverified 0Designing a Conditional Prior Distribution for Flow-Based Generative Models Feb 13, 2025 Image Generation Text to Image Generation
— Unverified 0Detecting Malicious Concepts Without Image Generation in AIGC Feb 13, 2025 Image Generation Text to Image Generation
— Unverified 0ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Feb 13, 2025 Image Generation Image Retrieval
— Unverified 0PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation Feb 12, 2025 Image Generation Text to Image Generation
— Unverified 0Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio Feb 12, 2025 Image Generation
Code Code Available 0Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation Feb 12, 2025 Denoising Image Generation
— Unverified 0Ultrasound Image Generation using Latent Diffusion Models Feb 12, 2025 Image Generation
— Unverified 0HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification Feb 12, 2025 Cell Segmentation Image Generation
Code Code Available 1A Survey on Pre-Trained Diffusion Model Distillations Feb 12, 2025 Image Generation model
— Unverified 0ID-Cloak: Crafting Identity-Specific Cloaks Against Personalized Text-to-Image Generation Feb 12, 2025 Image Generation Text to Image Generation
— Unverified 0BCDDM: Branch-Corrected Denoising Diffusion Model for Black Hole Image Generation Feb 12, 2025 Denoising Image Generation
— Unverified 0SurGrID: Controllable Surgical Simulation via Scene Graph to Image Diffusion Feb 11, 2025 Denoising Image Generation
— Unverified 0Classifier-Free Guidance: From High-Dimensional Analysis to Generalized Guidance Forms Feb 11, 2025 Diversity Image Generation
— Unverified 0A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision Feb 11, 2025 3D geometry Autonomous Driving
— Unverified 0FlexControl: Computation-Aware ControlNet with Differentiable Router for Text-to-Image Generation Feb 11, 2025 Denoising Image Generation
Code Code Available 0Training-Free Safe Denoisers for Safe Use of Diffusion Models Feb 11, 2025 Image Generation Negation
— Unverified 0Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization Feb 11, 2025 Image Generation Motion Generation
— Unverified 0Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models Feb 11, 2025 Image Generation Style Transfer
Code Code Available 1TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation Feb 11, 2025 Image Generation
Code Code Available 2Magic 1-For-1: Generating One Minute Video Clips within One Minute Feb 11, 2025 Image Generation Image to Video Generation
Code Code Available 0Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers Feb 11, 2025 image-classification Image Classification
— Unverified 0CAT: Contrastive Adversarial Training for Evaluating the Robustness of Protective Perturbations in Latent Diffusion Models Feb 11, 2025 Image Generation
Code Code Available 0CausalGeD: Blending Causality and Diffusion for Spatial Gene Expression Generation Feb 11, 2025 Attribute Image Generation
— Unverified 0RusCode: Russian Cultural Code Benchmark for Text-to-Image Generation Feb 11, 2025 Image Generation Text to Image Generation
Code Code Available 0Generative Modeling with Bayesian Sample Inference Feb 11, 2025 Density Estimation Image Generation
Code Code Available 1SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based Sketches Feb 11, 2025 Image Generation Text to Image Generation
Code Code Available 0Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models Feb 10, 2025 Image Generation Response Generation
Code Code Available 1CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers Feb 10, 2025 Image Generation Video Generation
— Unverified 0Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing Feb 10, 2025 Autonomous Driving Image Generation
— Unverified 0From Image to Video: An Empirical Study of Diffusion Representations Feb 10, 2025 Action Recognition Depth Estimation
— Unverified 0Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models Feb 10, 2025 Conditional Image Generation Image Generation
— Unverified 0Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation Feb 9, 2025 Image Generation Personalized Image Generation
Code Code Available 1UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and Understanding Feb 8, 2025 Denoising Image Generation
Code Code Available 1BF-GAN: Development of an AI-driven Bubbly Flow Image Generation Model Using Generative Adversarial Networks Feb 8, 2025 Image Generation
Code Code Available 0Diverse Image Generation with Diffusion Models and Cross Class Label Learning for Polyp Classification Feb 8, 2025 Image Generation Medical Procedure
Code Code Available 0Differentially Private Synthetic Data via APIs 3: Using Simulators Instead of Foundation Model Feb 8, 2025 Image Generation
Code Code Available 2Goku: Flow Based Video Generative Foundation Models Feb 7, 2025 Image Generation Text to Image Generation
Code Code Available 7Cached Multi-Lora Composition for Multi-Concept Image Generation Feb 7, 2025 Computational Efficiency Denoising
Code Code Available 1Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment Feb 7, 2025 Diversity Human-Object Interaction Detection
— Unverified 0QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation Feb 7, 2025 Image Generation Quantization
— Unverified 0