| Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Feb 22, 2024 | Video Generation | —Unverified | 0 | 0 |
| NewMove: Customizing text-to-video models with novel motions | Dec 7, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Jan 18, 2024 | ObjectText-to-Video Generation | —Unverified | 0 | 0 |
| CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers | Feb 10, 2025 | Image GenerationVideo Generation | —Unverified | 0 | 0 |
| CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention | Sep 3, 2024 | Human AnimationVideo Generation | —Unverified | 0 | 0 |
| Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | May 15, 2024 | Image to Video GenerationOptical Flow Estimation | —Unverified | 0 | 0 |
| DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models | May 11, 2025 | parameter-efficient fine-tuningVideo Alignment | —Unverified | 0 | 0 |
| Decouple Content and Motion for Conditional Image-to-Video Generation | Nov 24, 2023 | Image to Video GenerationVideo Generation | —Unverified | 0 | 0 |
| DeepHS-HDRVideo: Deep High Speed High Dynamic Range Video Reconstruction | Oct 10, 2022 | Optical Flow EstimationVideo Frame Interpolation | —Unverified | 0 | 0 |
| DeepRhythm: Exposing DeepFakes with Attentional Visual Heartbeat Rhythms | Jun 13, 2020 | DeepFake DetectionFace Swapping | —Unverified | 0 | 0 |
| DeepVerse: 4D Autoregressive Video Generation as a World Model | Jun 1, 2025 | Video Generation | —Unverified | 0 | 0 |
| Deep Video Generation, Prediction and Completion of Human Action Sequences | Nov 23, 2017 | Human action generationPrediction | —Unverified | 0 | 0 |
| Denoising Diffusion Probabilistic Models in Six Simple Steps | Feb 6, 2024 | DenoisingVideo Generation | —Unverified | 0 | 0 |
| Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation | Sep 19, 2024 | DenoisingVideo Generation | —Unverified | 0 | 0 |
| Designing Parameter and Compute Efficient Diffusion Transformers using Distillation | Feb 20, 2025 | Knowledge DistillationNVIDIA Jetson Orin Nano | —Unverified | 0 | 0 |
| DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing | Jun 26, 2025 | Video EditingVideo Generation | —Unverified | 0 | 0 |
| Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling | Dec 30, 2024 | Retrieval-augmented GenerationStory Visualization | —Unverified | 0 | 0 |
| DialogueNeRF: Towards Realistic Avatar Face-to-Face Conversation Video Generation | Mar 15, 2022 | NeRFTalking Head Generation | —Unverified | 0 | 0 |
| DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models | Dec 5, 2024 | Temporal SequencesVideo Generation | —Unverified | 0 | 0 |
| DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-based Human Video Generation | Jan 1, 2024 | Video Generation | —Unverified | 0 | 0 |
| DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures | Sep 11, 2024 | DiversityTalking Head Generation | —Unverified | 0 | 0 |
| Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation | Jan 6, 2023 | Face GenerationTalking Face Generation | —Unverified | 0 | 0 |
| Diffusion Adversarial Post-Training for One-Step Video Generation | Jan 14, 2025 | Video Generation | —Unverified | 0 | 0 |
| Diffusion-based Realistic Listening Head Generation via Hybrid Motion Modeling | Jan 1, 2025 | Motion GenerationVideo Generation | —Unverified | 0 | 0 |
| Diffusion Models for Robotic Manipulation: A Survey | Apr 11, 2025 | Data AugmentationImage Augmentation | —Unverified | 0 | 0 |
| Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data | Jul 23, 2024 | Video Generation | —Unverified | 0 | 0 |
| DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Nov 7, 2024 | 3D GenerationDenoising | —Unverified | 0 | 0 |
| Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion | Feb 5, 2024 | ObjectVideo Generation | —Unverified | 0 | 0 |
| DirectorLLM for Human-Centric Video Generation | Dec 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control | May 21, 2024 | AttributeMotion Generation | —Unverified | 0 | 0 |
| Disentangled Recurrent Wasserstein Autoencoder | Jan 19, 2021 | DisentanglementRepresentation Learning | —Unverified | 0 | 0 |
| Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation | May 26, 2024 | Video Generation | —Unverified | 0 | 0 |
| DiTFastAttn: Attention Compression for Diffusion Transformer Models | Jun 12, 2024 | 2kImage Generation | —Unverified | 0 | 0 |
| DiTPainter: Efficient Video Inpainting with Diffusion Transformers | Apr 22, 2025 | Video GenerationVideo Inpainting | —Unverified | 0 | 0 |
| DIVD: Deblurring with Improved Video Diffusion Model | Dec 1, 2024 | Deblurringmodel | —Unverified | 0 | 0 |
| DiVE: DiT-based Video Generation with Enhanced Control | Sep 3, 2024 | Autonomous DrivingVideo Generation | —Unverified | 0 | 0 |
| DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer | Apr 28, 2025 | Video Generation | —Unverified | 0 | 0 |
| DIY Human Action Data Set Generation | Mar 29, 2018 | Action RecognitionTemporal Action Localization | —Unverified | 0 | 0 |
| DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization | Dec 20, 2024 | Computational EfficiencyDiversity | —Unverified | 0 | 0 |
| DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships | Oct 14, 2024 | Video Generation | —Unverified | 0 | 0 |
| DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers | Jun 12, 2025 | Data AugmentationMarketing | —Unverified | 0 | 0 |
| DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds | May 30, 2025 | Image InpaintingVideo Generation | —Unverified | 0 | 0 |
| DreamDrive: Generative 4D Scene Modeling from Street View Images | Dec 31, 2024 | Autonomous DrivingNeural Rendering | —Unverified | 0 | 0 |
| DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework | Aug 21, 2024 | Video Generation | —Unverified | 0 | 0 |
| Dreamitate: Real-World Visuomotor Policy Learning via Video Generation | Jun 24, 2024 | Video Generation | —Unverified | 0 | 0 |
| Dreamix: Video Diffusion Models are General Video Editors | Feb 2, 2023 | Image AnimationImage to Video Generation | —Unverified | 0 | 0 |
| DreaMoving: A Human Video Generation Framework based on Diffusion Models | Dec 8, 2023 | Video Generation | —Unverified | 0 | 0 |
| DreamRelation: Relation-Centric Video Customization | Mar 10, 2025 | RelationTriplet | —Unverified | 0 | 0 |
| DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation | Nov 25, 2024 | Large Language ModelMotion Planning | —Unverified | 0 | 0 |
| DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control | Oct 17, 2024 | Video Generation | —Unverified | 0 | 0 |