| Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation | Feb 11, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| Contextual RNN-GANs for Abstract Reasoning Diagram Generation | Sep 29, 2016 | Generative Adversarial NetworkVideo Generation | —Unverified | 0 |
| Continuously Controllable Facial Expression Editing in Talking Face Videos | Sep 17, 2022 | Image-to-Image TranslationVideo Generation | —Unverified | 0 |
| Continuous-Time Video Generation via Learning Motion Dynamics with Neural ODE | Dec 21, 2021 | Unconditional Video GenerationVideo Generation | —Unverified | 0 |
| Contrastive Video Textures | Jan 1, 2021 | Contrastive LearningVideo Generation | —Unverified | 0 |
| Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation | Aug 9, 2018 | Facial expression generationImage-to-Image Translation | —Unverified | 0 |
| Controllable Longer Image Animation with Diffusion Models | May 27, 2024 | Image Animationmotion prediction | —Unverified | 0 |
| Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE | Mar 9, 2023 | Video Generation | —Unverified | 0 |
| Controllable Video Generation through Global and Local Motion Dynamics | Apr 13, 2022 | Video Generation | —Unverified | 0 |
| Controllable Video Generation With Sparse Trajectories | Jun 1, 2018 | Video GenerationVideo Prediction | —Unverified | 0 |
| Convergence of Diffusion Models Under the Manifold Hypothesis in High-Dimensions | Sep 27, 2024 | DenoisingGaussian Processes | —Unverified | 0 |
| Copy Motion From One to Another: Fake Motion Video Generation | May 3, 2022 | Video Generation | —Unverified | 0 |
| Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement | Jan 1, 2025 | Gesture GenerationMotion Generation | —Unverified | 0 |
| CPA: Camera-pose-awareness Diffusion Transformer for Video Generation | Dec 2, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Cross-Modal Learning for Music-to-Music-Video Description Generation | Mar 14, 2025 | Video DescriptionVideo Generation | —Unverified | 0 |
| Cross-View Exocentric to Egocentric Video Synthesis | Jul 7, 2021 | Generative Adversarial NetworkVideo Generation | —Unverified | 0 |
| Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Apr 15, 2024 | GPUImage Generation | —Unverified | 0 |
| Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes | May 30, 2025 | counterfactualVideo Generation | —Unverified | 0 |
| Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Feb 22, 2024 | Video Generation | —Unverified | 0 |
| NewMove: Customizing text-to-video models with novel motions | Dec 7, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Jan 18, 2024 | ObjectText-to-Video Generation | —Unverified | 0 |
| CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers | Feb 10, 2025 | Image GenerationVideo Generation | —Unverified | 0 |
| CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention | Sep 3, 2024 | Human AnimationVideo Generation | —Unverified | 0 |
| Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | May 15, 2024 | Image to Video GenerationOptical Flow Estimation | —Unverified | 0 |
| DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models | May 11, 2025 | parameter-efficient fine-tuningVideo Alignment | —Unverified | 0 |
| Decouple Content and Motion for Conditional Image-to-Video Generation | Nov 24, 2023 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| DeepHS-HDRVideo: Deep High Speed High Dynamic Range Video Reconstruction | Oct 10, 2022 | Optical Flow EstimationVideo Frame Interpolation | —Unverified | 0 |
| DeepRhythm: Exposing DeepFakes with Attentional Visual Heartbeat Rhythms | Jun 13, 2020 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| DeepVerse: 4D Autoregressive Video Generation as a World Model | Jun 1, 2025 | Video Generation | —Unverified | 0 |
| Deep Video Generation, Prediction and Completion of Human Action Sequences | Nov 23, 2017 | Human action generationPrediction | —Unverified | 0 |
| Denoising Diffusion Probabilistic Models in Six Simple Steps | Feb 6, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation | Sep 19, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| Designing Parameter and Compute Efficient Diffusion Transformers using Distillation | Feb 20, 2025 | Knowledge DistillationNVIDIA Jetson Orin Nano | —Unverified | 0 |
| DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing | Jun 26, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling | Dec 30, 2024 | Retrieval-augmented GenerationStory Visualization | —Unverified | 0 |
| DialogueNeRF: Towards Realistic Avatar Face-to-Face Conversation Video Generation | Mar 15, 2022 | NeRFTalking Head Generation | —Unverified | 0 |
| DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models | Dec 5, 2024 | Temporal SequencesVideo Generation | —Unverified | 0 |
| DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-based Human Video Generation | Jan 1, 2024 | Video Generation | —Unverified | 0 |
| DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures | Sep 11, 2024 | DiversityTalking Head Generation | —Unverified | 0 |
| Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation | Jan 6, 2023 | Face GenerationTalking Face Generation | —Unverified | 0 |
| Diffusion Adversarial Post-Training for One-Step Video Generation | Jan 14, 2025 | Video Generation | —Unverified | 0 |
| Diffusion-based Realistic Listening Head Generation via Hybrid Motion Modeling | Jan 1, 2025 | Motion GenerationVideo Generation | —Unverified | 0 |
| Diffusion Models for Robotic Manipulation: A Survey | Apr 11, 2025 | Data AugmentationImage Augmentation | —Unverified | 0 |
| Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data | Jul 23, 2024 | Video Generation | —Unverified | 0 |
| DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Nov 7, 2024 | 3D GenerationDenoising | —Unverified | 0 |
| Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion | Feb 5, 2024 | ObjectVideo Generation | —Unverified | 0 |
| DirectorLLM for Human-Centric Video Generation | Dec 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control | May 21, 2024 | AttributeMotion Generation | —Unverified | 0 |
| Disentangled Recurrent Wasserstein Autoencoder | Jan 19, 2021 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation | May 26, 2024 | Video Generation | —Unverified | 0 |