GlyphDraw: Seamlessly Rendering Text with Intricate Spatial Structures in Text-to-Image Generation Mar 31, 2023 Image Generation Optical Character Recognition (OCR)
Code Code Available 25 GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction May 30, 2023 Image Generation Instruction Following
Code Code Available 25 HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation Oct 18, 2024 Disentanglement Image Generation
Code Code Available 25 Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising May 29, 2023 Denoising Image Generation
Code Code Available 25 Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis Dec 3, 2024 Image Generation
Code Code Available 25 GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching Mar 17, 2025 Autonomous Driving Image Generation
Code Code Available 25 Generative Enhancement for 3D Medical Images Mar 19, 2024 counterfactual Image Generation
Code Code Available 25 Generative Adversarial Transformers Mar 1, 2021 Disentanglement Image Generation
Code Code Available 25 Generative Image as Action Models Jul 10, 2024 Image Generation Robot Manipulation
Code Code Available 25 Geodesic Diffusion Models for Medical Image-to-Image Generation Mar 2, 2025 Denoising Image Denoising
Code Code Available 25 GAUDI: A Neural Architect for Immersive 3D Scene Generation Jul 27, 2022 Image Generation Scene Generation
Code Code Available 25 Gaussian Mixture Flow Matching Models Apr 7, 2025 Denoising Image Generation
Code Code Available 25 Attention Mechanisms in Computer Vision: A Survey Nov 15, 2021 image-classification Image Classification
Code Code Available 25 GAN Prior Embedded Network for Blind Face Restoration in the Wild May 13, 2021 Blind Face Restoration Decoder
Code Code Available 25 GANSpace: Discovering Interpretable GAN Controls Apr 6, 2020 Image Generation
Code Code Available 25 Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition Feb 23, 2024 Image Generation Personalized Image Generation
Code Code Available 25 Blended Latent Diffusion Jun 6, 2022 Image Generation Image Inpainting
Code Code Available 25 Generating Images with Multimodal Language Models May 26, 2023 Decoder Image Generation
Code Code Available 25 Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions Apr 27, 2025 Image Generation Motion Synthesis
Code Code Available 25 Generative Diffusion Models on Graphs: Methods and Applications Feb 6, 2023 Denoising Graph Generation
Code Code Available 25 Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models May 24, 2024 Image Generation Machine Unlearning
Code Code Available 25 Generative Modeling by Estimating Gradients of the Data Distribution Jul 12, 2019 Image Generation Image Inpainting
Code Code Available 25 GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis Jan 30, 2023 Image Generation Scene Understanding
Code Code Available 25 Attention Calibration for Disentangled Text-to-Image Personalization Mar 27, 2024 Image Generation Novel Concepts
Code Code Available 25 From Text to Pose to Image: Improving Diffusion Model Control and Quality Nov 19, 2024 Image Generation Prompt Engineering
Code Code Available 25 GAN Compression: Efficient Architectures for Interactive Conditional GANs Mar 19, 2020 Image Generation Neural Architecture Search
Code Code Available 25 GenAI Arena: An Open Evaluation Platform for Generative Models Jun 6, 2024 Image Generation Instruction Following
Code Code Available 25 Geometry-Complete Diffusion for 3D Molecule Generation and Optimization Feb 8, 2023 3D Molecule Generation Denoising
Code Code Available 25 BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities Oct 18, 2024 Conditional Image Generation Image Generation
Code Code Available 25 Denoising Diffusion Models for Plug-and-Play Image Restoration May 15, 2023 Deblurring Denoising
Code Code Available 25 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis Mar 19, 2024 Image Generation Text to Image Generation
Code Code Available 25 GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning May 22, 2025 Attribute Image Generation
Code Code Available 25 Flux Already Knows -- Activating Subject-Driven Image Generation without Training Apr 12, 2025 Image Generation Virtual Try-on
Code Code Available 25 Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation May 31, 2024 3D Generation Image Generation
Code Code Available 25 Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos Jul 23, 2024 Image Generation Point Tracking
Code Code Available 25 Boosting Latent Diffusion with Flow Matching Dec 12, 2023 Decoder Diversity
Code Code Available 25 Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation Mar 25, 2024 Denoising Image Generation
Code Code Available 25 FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner Sep 26, 2024 Image Generation Text to Image Generation
Code Code Available 25 Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks May 5, 2021 image-classification Image Classification
Code Code Available 25 BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion Jul 20, 2023 Conditional Text-to-Image Synthesis Denoising
Code Code Available 25 Deep PCB To COCO Convertor May 1, 2022 Classification Data Augmentation
Code Code Available 25 Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Oct 17, 2024 Image Generation Text to Image Generation
Code Code Available 25 FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition May 22, 2024 Image Generation
Code Code Available 25 Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality Mar 1, 2025 Image Enhancement Image Generation
Code Code Available 25 Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes Mar 5, 2023 3D Human Pose Estimation Human Detection
Code Code Available 25 Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis Jun 15, 2023 Image Generation Preference Mapping
Code Code Available 25 FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching Dec 19, 2024 Image Generation Prediction
Code Code Available 25 Flow-Anchored Consistency Models Jul 4, 2025 Image Generation
Code Code Available 25 Flow-Guided Diffusion for Video Inpainting Nov 26, 2023 Denoising Image Generation
Code Code Available 25 Flow Matching in Latent Space Jul 17, 2023 Computational Efficiency Image Generation
Code Code Available 25