F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching Oct 9, 2024 Denoising text-to-speech
Code Code Available 115 SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers Jun 1, 2025 Denoising
Code Code Available 95 OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on Mar 4, 2024 Denoising Image Generation
Code Code Available 95 LTX-Video: Realtime Video Latent Diffusion Dec 30, 2024 Denoising GPU
Code Code Available 95 Flow-GRPO: Training Flow Matching Models via Online RL May 8, 2025 Denoising Diversity
Code Code Available 75 M&M VTO: Multi-Garment Virtual Try-On and Editing Jun 6, 2024 Denoising Super-Resolution
Code Code Available 75 Improving Sample Quality of Diffusion Models Using Self-Attention Guidance Oct 3, 2022 Denoising Diversity
Code Code Available 75 Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis May 14, 2025 Denoising Depth Estimation
Code Code Available 75 One-Step Image Translation with Text-to-Image Models Mar 18, 2024 Denoising Translation
Code Code Available 75 Pseudo Numerical Methods for Diffusion Models on Manifolds Feb 20, 2022 Denoising Image Generation
Code Code Available 65 StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation Dec 19, 2023 Denoising Image Generation
Code Code Available 65 Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think Oct 9, 2024 Denoising Image Generation
Code Code Available 55 Controllable Generation with Text-to-Image Diffusion Models: A Survey Mar 7, 2024 Denoising
Code Code Available 55 FlowTok: Flowing Seamlessly Across Text and Image Tokens Mar 13, 2025 Denoising Image to text
Code Code Available 55 Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Nov 28, 2023 Attribute Denoising
Code Code Available 55 ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Mar 8, 2024 Denoising Image Generation
Code Code Available 55 DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving Nov 22, 2024 Autonomous Driving Denoising
Code Code Available 55 DreamFusion: Text-to-3D using 2D Diffusion Sep 29, 2022 Denoising Image Generation
Code Code Available 55 Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems Jul 17, 2024 Autonomous Web Navigation Denoising
Code Code Available 55 Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning Aug 26, 2024 Denoising reinforcement-learning
Code Code Available 55 StableAnimator: High-Quality Identity-Preserving Human Image Animation Nov 26, 2024 Denoising Face Reenactment
Code Code Available 55 DanceGRPO: Unleashing GRPO on Visual Generation May 12, 2025 Denoising reinforcement-learning
Code Code Available 55 OminiControl2: Efficient Conditioning for Diffusion Transformers Mar 11, 2025 Conditional Image Generation Denoising
Code Code Available 55 IMAGDressing-v1: Customizable Virtual Dressing Jul 17, 2024 Denoising Image Generation
Code Code Available 55 RePaint: Inpainting using Denoising Diffusion Probabilistic Models Jan 24, 2022 Denoising Image Inpainting
Code Code Available 45 Cameras as Rays: Pose Estimation via Ray Diffusion Feb 22, 2024 3D Reconstruction Camera Pose Estimation
Code Code Available 45 Energy-Based Transformers are Scalable Learners and Thinkers Jul 2, 2025 Denoising Image Denoising
Code Code Available 45 Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Mar 12, 2025 Denoising Language Modeling
Code Code Available 45 FSID: Fully Synthetic Image Denoising via Procedural Scene Generation Dec 7, 2022 Denoising Image Denoising
Code Code Available 45 Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian Noise Dec 5, 2024 Denoising Image Restoration
Code Code Available 45 PharMolixFM: All-Atom Foundation Models for Molecular Modeling and Generation Mar 12, 2025 All Denoising
Code Code Available 45 PromptFix: You Prompt and We Fix the Photo May 27, 2024 Denoising Image Generation
Code Code Available 45 Simple Baselines for Image Restoration Apr 10, 2022 Deblurring Denoising
Code Code Available 45 OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models Mar 16, 2024 Denoising Image Generation
Code Code Available 45 Diffusion Models in Low-Level Vision: A Survey Jun 17, 2024 Denoising Survey
Code Code Available 45 MotionClone: Training-Free Motion Cloning for Controllable Video Generation Jun 8, 2024 Denoising Motion Generation
Code Code Available 45 AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data Feb 1, 2024 Conditional Image Generation Denoising
Code Code Available 45 DiffusionDet: Diffusion Model for Object Detection Nov 17, 2022 Denoising model
Code Code Available 45 DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Jun 25, 2025 Code Generation Denoising
Code Code Available 45 One Step Diffusion via Shortcut Models Oct 16, 2024 Denoising Scheduling
Code Code Available 45 VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation Mar 15, 2023 Code Generation Denoising
Code Code Available 45 Diffusion Model-Based Image Editing: A Survey Feb 27, 2024 Denoising Image Generation
Code Code Available 45 Adversarial Diffusion Compression for Real-World Image Super-Resolution Nov 20, 2024 Decoder Denoising
Code Code Available 45 DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images Jun 5, 2024 2D Object Detection Denoising
Code Code Available 45 Diffusion Models for Medical Image Analysis: A Comprehensive Survey Nov 14, 2022 Denoising Medical Image Analysis
Code Code Available 45 AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising Jun 11, 2024 Denoising
Code Code Available 45 Latent Swap Joint Diffusion for 2D Long-Form Latent Generation Feb 7, 2025 Audio Generation Denoising
Code Code Available 45 High-Resolution Image Synthesis with Latent Diffusion Models Dec 20, 2021 Denoising GPU
Code Code Available 45 DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior Aug 29, 2023 Blind Face Restoration Denoising
Code Code Available 45 InstructIR: High-Quality Image Restoration Following Human Instructions Jan 29, 2024 Deblurring Denoising
Code Code Available 45