MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost Dec 2, 2024 Image Generation
Code Code Available 35 Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework Oct 28, 2024 Image Generation Image Manipulation
Code Code Available 35 AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Apr 1, 2025 Image Generation
Code Code Available 35 Deep Generative Models on 3D Representations: A Survey Oct 27, 2022 3D-Aware Image Synthesis 3D Shape Generation
Code Code Available 35 An Image is Worth 32 Tokens for Reconstruction and Generation Jun 11, 2024 Image Generation Image Reconstruction
Code Code Available 35 Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation Mar 3, 2025 3D Generation 3D Reconstruction
Code Code Available 35 Deciphering Oracle Bone Language with Diffusion Models Jun 2, 2024 Decipherment Image Generation
Code Code Available 35 AI2Agent: An End-to-End Framework for Deploying AI Projects as Autonomous Agents Mar 31, 2025 Image Generation Text to Image Generation
Code Code Available 35 CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation Oct 12, 2024 Conditional Image Generation GPU
Code Code Available 35 Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models Feb 7, 2024 counterfactual Image Generation
Code Code Available 35 InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation Sep 12, 2023 GPU Image Generation
Code Code Available 35 Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models Jan 1, 2024 Image Generation Text to Image Generation
Code Code Available 35 Improved Denoising Diffusion Probabilistic Models Feb 18, 2021 Denoising Image Generation
Code Code Available 35 Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer May 7, 2024 Image Generation Super-Resolution
Code Code Available 35 ImageInWords: Unlocking Hyper-Detailed Image Descriptions May 5, 2024 Image Generation Specificity
Code Code Available 35 ImageFolder: Autoregressive Image Generation with Folded Tokens Oct 2, 2024 Image Generation Image Reconstruction
Code Code Available 35 ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation Apr 12, 2023 Image Generation Preference Mapping
Code Code Available 35 Attention Distillation: A Unified Approach to Visual Characteristics Transfer Feb 27, 2025 Denoising Image Generation
Code Code Available 35 ControlAR: Controllable Image Generation with Autoregressive Models Oct 3, 2024 Image Generation
Code Code Available 35 Image and Video Tokenization with Binary Spherical Quantization Jun 11, 2024 Decoder Image Generation
Code Code Available 35 All are Worth Words: A ViT Backbone for Diffusion Models Sep 25, 2022 All Conditional Image Generation
Code Code Available 35 Consistency Flow Matching: Defining Straight Flows with Velocity Consistency Jul 2, 2024 Image Generation
Code Code Available 35 Highly Compressed Tokenizer Can Generate Without Training Jun 9, 2025 Image Generation Quantization
Code Code Available 35 DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models Feb 8, 2022 Diagnostic Image Captioning
Code Code Available 35 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Sep 11, 2024 3D Generation 3D Reconstruction
Code Code Available 35 Consistency Models Made Easy Jun 20, 2024 Computational Efficiency GPU
Code Code Available 35 Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models Nov 20, 2023 Image Generation
Code Code Available 35 DDT: Decoupled Diffusion Transformer Apr 8, 2025 Denoising Image Generation
Code Code Available 35 Hierarchical Text-Conditional Image Generation with CLIP Latents Apr 13, 2022 Conditional Image Generation Decoder
Code Code Available 35 MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation Feb 16, 2023 Image Generation Text to Image Generation
Code Code Available 35 REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers Apr 15, 2025 Image Generation
Code Code Available 35 Collaborative Neural Rendering using Anime Character Sheets Jul 12, 2022 Image Generation Image to 3D
Code Code Available 25 Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient Nov 26, 2024 GPU Image Generation
Code Code Available 25 GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation Oct 27, 2024 Image Generation Text to Image Generation
Code Code Available 25 Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints Nov 26, 2024 Denoising Image Generation
Code Code Available 25 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Apr 4, 2024 Attribute Image Captioning
Code Code Available 25 GRPose: Learning Graph Relations for Human Image Generation with Pose Priors Aug 29, 2024 Image Generation Pose Estimation
Code Code Available 25 GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction May 30, 2023 Image Generation Instruction Following
Code Code Available 25 GPT4Point: A Unified Framework for Point-Language Understanding and Generation Dec 5, 2023 3D Generation Image Generation
Code Code Available 25 GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning May 22, 2025 Attribute Image Generation
Code Code Available 25 GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal-Conditioned Policy Aug 26, 2024 Few-Shot Learning Image Generation
Code Code Available 25 Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models Sep 17, 2024 Brain Computer Interface EEG
Code Code Available 25 CogView: Mastering Text-to-Image Generation via Transformers May 26, 2021 Image Generation Super-Resolution
Code Code Available 25 GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models Dec 20, 2021 Diversity Image Generation
Code Code Available 25 BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis Mar 25, 2022 Image Generation Speech Synthesis
Code Code Available 25 BCI: Breast Cancer Immunohistochemical Image Generation through Pyramid Pix2pix Apr 25, 2022 Breast Cancer Detection Breast Cancer Histology Image Classification
Code Code Available 25 CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers Apr 28, 2022 Image Generation Language Modeling
Code Code Available 25 GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis Apr 9, 2024 Image Generation Zero-shot Generalization
Code Code Available 25 Bayesian Flow Networks Aug 14, 2023 Bayesian Inference Data Compression
Code Code Available 25 Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive Jan 16, 2024 Domain Generalization Image Generation
Code Code Available 25