| DiTFastAttn: Attention Compression for Diffusion Transformer Models | Jun 12, 2024 | 2kImage Generation | —Unverified | 0 |
| DiTPainter: Efficient Video Inpainting with Diffusion Transformers | Apr 22, 2025 | Video GenerationVideo Inpainting | —Unverified | 0 |
| DIVD: Deblurring with Improved Video Diffusion Model | Dec 1, 2024 | Deblurringmodel | —Unverified | 0 |
| DiVE: DiT-based Video Generation with Enhanced Control | Sep 3, 2024 | Autonomous DrivingVideo Generation | —Unverified | 0 |
| DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer | Apr 28, 2025 | Video Generation | —Unverified | 0 |
| DIY Human Action Data Set Generation | Mar 29, 2018 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization | Dec 20, 2024 | Computational EfficiencyDiversity | —Unverified | 0 |
| DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships | Oct 14, 2024 | Video Generation | —Unverified | 0 |
| DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers | Jun 12, 2025 | Data AugmentationMarketing | —Unverified | 0 |
| DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds | May 30, 2025 | Image InpaintingVideo Generation | —Unverified | 0 |
| DreamDrive: Generative 4D Scene Modeling from Street View Images | Dec 31, 2024 | Autonomous DrivingNeural Rendering | —Unverified | 0 |
| DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework | Aug 21, 2024 | Video Generation | —Unverified | 0 |
| Dreamitate: Real-World Visuomotor Policy Learning via Video Generation | Jun 24, 2024 | Video Generation | —Unverified | 0 |
| Dreamix: Video Diffusion Models are General Video Editors | Feb 2, 2023 | Image AnimationImage to Video Generation | —Unverified | 0 |
| DreaMoving: A Human Video Generation Framework based on Diffusion Models | Dec 8, 2023 | Video Generation | —Unverified | 0 |
| DreamRelation: Relation-Centric Video Customization | Mar 10, 2025 | RelationTriplet | —Unverified | 0 |
| DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation | Nov 25, 2024 | Large Language ModelMotion Planning | —Unverified | 0 |
| DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control | Oct 17, 2024 | Video Generation | —Unverified | 0 |
| DreamVideo: Composing Your Dream Videos with Customized Subject and Motion | Dec 7, 2023 | Image GenerationVideo Generation | —Unverified | 0 |
| DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance | Dec 5, 2023 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Oct 17, 2024 | 3DGS4D reconstruction | —Unverified | 0 |
| DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Aug 29, 2024 | Autonomous DrivingDenoising | —Unverified | 0 |
| DriveScape: High-Resolution Driving Video Generation by Multi-View Feature Fusion | Jan 1, 2025 | Autonomous DrivingDenoising | —Unverified | 0 |
| DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation | Sep 9, 2024 | Autonomous DrivingVideo Generation | —Unverified | 0 |
| DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers | Dec 24, 2024 | NavSimTrajectory Planning | —Unverified | 0 |
| Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-to-Video Synthesis | Feb 26, 2021 | Motion GenerationVideo Generation | —Unverified | 0 |
| DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization | May 4, 2025 | DenoisingText-to-Video Generation | —Unverified | 0 |
| Dual-Stream Diffusion Net for Text-to-Video Generation | Aug 16, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| DualX-VSR: Dual Axial SpatialTemporal Transformer for Real-World Video Super-Resolution without Motion Compensation | Jun 5, 2025 | Motion CompensationOptical Flow Estimation | —Unverified | 0 |
| Dynamic Camera Poses and Where to Find Them | Jan 1, 2025 | Point TrackingPose Estimation | —Unverified | 0 |
| Dynamic-I2V: Exploring Image-to-Video Generaion Models via Multimodal LLM | May 26, 2025 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| Dynamic Neural Textures: Generating Talking-Face Videos with Continuously Controllable Expressions | Apr 13, 2022 | Video Generation | —Unverified | 0 |
| DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes | Dec 15, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation | Apr 21, 2025 | AttributeDenoising | —Unverified | 0 |
| E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors | Jul 11, 2024 | Image GenerationVideo Generation | —Unverified | 0 |
| EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation | Aug 23, 2024 | Image GenerationVideo Generation | —Unverified | 0 |
| EasyGenNet: An Efficient Framework for Audio-Driven Gesture Video Generation Based on Diffusion Model | Apr 11, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| Echocardiography video synthesis from end diastolic semantic map via diffusion model | Oct 11, 2023 | DenoisingVideo Generation | —Unverified | 0 |
| EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation | Mar 28, 2025 | Medical Image AnalysisPrivacy Preserving | —Unverified | 0 |
| EEG to fMRI Synthesis: Is Deep Learning a candidate? | Sep 29, 2020 | Deep LearningEEG | —Unverified | 0 |
| Efficient training for future video generation based on hierarchical disentangled representation of latent variables | Jun 7, 2021 | Future predictionImage Generation | —Unverified | 0 |
| Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition | Mar 21, 2024 | Video Generation | —Unverified | 0 |
| EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation | Nov 13, 2024 | Video Generation | —Unverified | 0 |
| EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation | Jan 1, 2025 | Image GenerationText-to-Video Generation | —Unverified | 0 |
| EMO2: End-Effector Guided Audio-Driven Avatar Video Generation | Jan 18, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Feb 27, 2024 | Video Generation | —Unverified | 0 |
| Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs | Aug 26, 2023 | In-Context LearningVideo Generation | —Unverified | 0 |
| Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning | Nov 17, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Enabling Versatile Controls for Video Diffusion Models | Mar 21, 2025 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Enabling Visual Composition and Animation in Unsupervised Video Generation | Mar 21, 2024 | Video Generation | —Unverified | 0 |