3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models Mar 27, 2025 3D Generation Image Generation
— Unverified 0UGen: Unified Autoregressive Multimodal Model with Progressive Vocabulary Learning Mar 27, 2025 Image Generation
— Unverified 0Model as a Game: On Numerical and Spatial Consistency for Generative Games Mar 27, 2025 Image Generation
— Unverified 0CTRL-O: Language-Controllable Object-Centric Visual Representation Learning Mar 27, 2025 Image Generation Object
— Unverified 0Unified Multimodal Discrete Diffusion Mar 26, 2025 Image Captioning Image Generation
Code Code Available 2High Quality Diffusion Distillation on a Single GPU with Relative and Absolute Position Matching Mar 26, 2025 GPU Image Generation
— Unverified 0BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation Mar 26, 2025 Descriptive Image Generation
— Unverified 0MMGen: Unified Multi-modal Image Generation and Understanding in One Go Mar 26, 2025 Image Generation
— Unverified 0RecTable: Fast Modeling Tabular Data with Rectified Flow Mar 26, 2025 Image Generation Text to Image Generation
Code Code Available 0Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models Mar 26, 2025 Image Generation
— Unverified 0Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability Mar 26, 2025 Age/Unbiased Decision Making
Code Code Available 1Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing Mar 25, 2025 Image Dehazing Image Generation
Code Code Available 2Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage Mar 25, 2025 Disentanglement Image Generation
— Unverified 0LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration Mar 25, 2025 Image Generation Object
Code Code Available 0VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models Mar 25, 2025 image-classification Image Classification
— Unverified 0SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation Mar 25, 2025 Computational Efficiency Image Generation
Code Code Available 0Scaling Down Text Encoders of Text-to-Image Diffusion Models Mar 25, 2025 GPU Image Generation
Code Code Available 2PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models Mar 25, 2025 Denoising Image Generation
— Unverified 0Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation Mar 25, 2025 Image Captioning Image Generation
— Unverified 0U-REPA: Aligning Diffusion U-Nets to ViTs Mar 24, 2025 Image Generation
Code Code Available 1PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models Mar 24, 2025 Computational Efficiency Image Generation
Code Code Available 0Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models Mar 24, 2025 4k Image Generation
Code Code Available 3Training-free Diffusion Acceleration with Bottleneck Sampling Mar 24, 2025 Denoising Image Generation
— Unverified 0Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models Mar 24, 2025 Image Generation Super-Resolution
Code Code Available 1Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control Mar 24, 2025 Image Generation Knowledge Distillation
— Unverified 0Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings Mar 24, 2025 Data Augmentation Image Cropping
— Unverified 0Equivariant Image Modeling Mar 24, 2025 Image Generation Zero-shot Generalization
Code Code Available 1An Image-like Diffusion Method for Human-Object Interaction Detection Mar 23, 2025 Human-Object Interaction Detection Image Generation
— Unverified 0DeLoRA: Decoupling Angles and Strength in Low-rank Adaptation Mar 23, 2025 Image Generation Natural Language Understanding
Code Code Available 1TransAnimate: Taming Layer Diffusion to Generate RGBA Video Mar 23, 2025 Image Generation Video Generation
— Unverified 0Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation Mar 23, 2025 Diversity Image Generation
Code Code Available 1Adoption of Watermarking Measures for AI-Generated Content and Implications under the EU AI Act Mar 23, 2025 Image Generation
— Unverified 0TCFG: Tangential Damping Classifier-free Guidance Mar 23, 2025 Image Generation
— Unverified 0Efficient Diffusion Training through Parallelization with Truncated Karhunen-Loève Expansion Mar 22, 2025 Denoising Image Generation
— Unverified 0DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis Mar 22, 2025 Image Augmentation Image Generation
— Unverified 0TDRI: Two-Phase Dialogue Refinement and Co-Adaptation for Interactive Image Generation Mar 22, 2025 Image Generation Text to Image Generation
— Unverified 0OMR-Diffusion:Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Intent Understanding Mar 22, 2025 Image Generation
— Unverified 0FundusGAN: A Hierarchical Feature-Aware Generative Framework for High-Fidelity Fundus Image Generation Mar 22, 2025 Diagnostic Image Generation
— Unverified 0ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation Mar 22, 2025 Image Generation Reinforcement Learning (RL)
— Unverified 0End-to-end Sketch-Guided Path Planning through Imitation Learning for Autonomous Mobile Robots Mar 21, 2025 Image Generation Imitation Learning
Code Code Available 0Halton Scheduler For Masked Generative Image Transformer Mar 21, 2025 Image Generation Text to Image Generation
Code Code Available 3D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens Mar 21, 2025 Conditional Image Generation Image Generation
— Unverified 0Bayesian generative models can flag performance loss, bias, and out-of-distribution image content Mar 21, 2025 Anomaly Detection Data Visualization
— Unverified 0Zero-Shot Styled Text Image Generation, but Make It Autoregressive Mar 21, 2025 Image Generation Text Generation
— Unverified 0Leveraging Text-to-Image Generation for Handling Spurious Correlation Mar 21, 2025 image-classification Image Classification
— Unverified 0EDiT: Efficient Diffusion Transformers with Linear Compressed Attention Mar 20, 2025 Image Generation
— Unverified 0World Knowledge from AI Image Generation for Robot Control Mar 20, 2025 Image Generation World Knowledge
— Unverified 0InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity Mar 20, 2025 Image Generation
Code Code Available 7Tokenize Image as a Set Mar 20, 2025 Image Generation
Code Code Available 2RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models Mar 20, 2025 Image Generation Medical Image Generation
— Unverified 0