F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching Oct 9, 2024 Denoising text-to-speech
Code Code Available 11SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers Jun 1, 2025 Denoising
Code Code Available 9LTX-Video: Realtime Video Latent Diffusion Dec 30, 2024 Denoising GPU
Code Code Available 9OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on Mar 4, 2024 Denoising Image Generation
Code Code Available 9Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis May 14, 2025 Denoising Depth Estimation
Code Code Available 7Flow-GRPO: Training Flow Matching Models via Online RL May 8, 2025 Denoising Diversity
Code Code Available 7M&M VTO: Multi-Garment Virtual Try-On and Editing Jun 6, 2024 Denoising Super-Resolution
Code Code Available 7One-Step Image Translation with Text-to-Image Models Mar 18, 2024 Denoising Translation
Code Code Available 7Improving Sample Quality of Diffusion Models Using Self-Attention Guidance Oct 3, 2022 Denoising Diversity
Code Code Available 7StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation Dec 19, 2023 Denoising Image Generation
Code Code Available 6Pseudo Numerical Methods for Diffusion Models on Manifolds Feb 20, 2022 Denoising Image Generation
Code Code Available 6DanceGRPO: Unleashing GRPO on Visual Generation May 12, 2025 Denoising reinforcement-learning
Code Code Available 5FlowTok: Flowing Seamlessly Across Text and Image Tokens Mar 13, 2025 Denoising Image to text
Code Code Available 5OminiControl2: Efficient Conditioning for Diffusion Transformers Mar 11, 2025 Conditional Image Generation Denoising
Code Code Available 5StableAnimator: High-Quality Identity-Preserving Human Image Animation Nov 26, 2024 Denoising Face Reenactment
Code Code Available 5DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving Nov 22, 2024 Autonomous Driving Denoising
Code Code Available 5Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think Oct 9, 2024 Denoising Image Generation
Code Code Available 5Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning Aug 26, 2024 Denoising reinforcement-learning
Code Code Available 5Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems Jul 17, 2024 Autonomous Web Navigation Denoising
Code Code Available 5IMAGDressing-v1: Customizable Virtual Dressing Jul 17, 2024 Denoising Image Generation
Code Code Available 5ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Mar 8, 2024 Denoising Image Generation
Code Code Available 5Controllable Generation with Text-to-Image Diffusion Models: A Survey Mar 7, 2024 Denoising
Code Code Available 5Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following Nov 28, 2023 Attribute Denoising
Code Code Available 5DreamFusion: Text-to-3D using 2D Diffusion Sep 29, 2022 Denoising Image Generation
Code Code Available 5Energy-Based Transformers are Scalable Learners and Thinkers Jul 2, 2025 Denoising Image Denoising
Code Code Available 4DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Jun 25, 2025 Code Generation Denoising
Code Code Available 4Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Mar 12, 2025 Denoising Language Modeling
Code Code Available 4PharMolixFM: All-Atom Foundation Models for Molecular Modeling and Generation Mar 12, 2025 All Denoising
Code Code Available 4Latent Swap Joint Diffusion for 2D Long-Form Latent Generation Feb 7, 2025 Audio Generation Denoising
Code Code Available 4Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian Noise Dec 5, 2024 Denoising Image Restoration
Code Code Available 4Adversarial Diffusion Compression for Real-World Image Super-Resolution Nov 20, 2024 Decoder Denoising
Code Code Available 4One Step Diffusion via Shortcut Models Oct 16, 2024 Denoising Scheduling
Code Code Available 4Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Sep 26, 2024 3D Reconstruction Denoising
Code Code Available 4Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User Feedback Jun 18, 2024 Denoising Recommendation Systems
Code Code Available 4Diffusion Models in Low-Level Vision: A Survey Jun 17, 2024 Denoising Survey
Code Code Available 4AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising Jun 11, 2024 Denoising
Code Code Available 4MotionClone: Training-Free Motion Cloning for Controllable Video Generation Jun 8, 2024 Denoising Motion Generation
Code Code Available 4DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images Jun 5, 2024 2D Object Detection Denoising
Code Code Available 4PromptFix: You Prompt and We Fix the Photo May 27, 2024 Denoising Image Generation
Code Code Available 4OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models Mar 16, 2024 Denoising Image Generation
Code Code Available 4Diffusion Model-Based Image Editing: A Survey Feb 27, 2024 Denoising Image Generation
Code Code Available 4Cameras as Rays: Pose Estimation via Ray Diffusion Feb 22, 2024 3D Reconstruction Camera Pose Estimation
Code Code Available 4AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data Feb 1, 2024 Conditional Image Generation Denoising
Code Code Available 4InstructIR: High-Quality Image Restoration Following Human Instructions Jan 29, 2024 Deblurring Denoising
Code Code Available 4DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior Aug 29, 2023 Blind Face Restoration Denoising
Code Code Available 4VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation Mar 15, 2023 Code Generation Denoising
Code Code Available 4FSID: Fully Synthetic Image Denoising via Procedural Scene Generation Dec 7, 2022 Denoising Image Denoising
Code Code Available 4Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model Dec 1, 2022 Colorization compressed sensing
Code Code Available 4DiffusionDet: Diffusion Model for Object Detection Nov 17, 2022 Denoising model
Code Code Available 4Diffusion Models for Medical Image Analysis: A Comprehensive Survey Nov 14, 2022 Denoising Medical Image Analysis
Code Code Available 4