| Controllable Video Generation With Sparse Trajectories | Jun 1, 2018 | Video GenerationVideo Prediction | —Unverified | 0 |
| Audio-driven Gesture Generation via Deviation Feature in the Latent Space | Mar 27, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation | Jul 9, 2025 | DescriptiveText Generation | —Unverified | 0 |
| FFA Sora, video generation as fundus fluorescein angiography simulator | Dec 23, 2024 | Privacy PreservingQuestion Answering | —Unverified | 0 |
| Controllable Video Generation through Global and Local Motion Dynamics | Apr 13, 2022 | Video Generation | —Unverified | 0 |
| Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE | Mar 9, 2023 | Video Generation | —Unverified | 0 |
| Controllable Longer Image Animation with Diffusion Models | May 27, 2024 | Image Animationmotion prediction | —Unverified | 0 |
| FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing | Mar 10, 2024 | Image GenerationText-to-Video Editing | —Unverified | 0 |
| Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions | Jul 27, 2024 | Computational EfficiencyVideo Generation | —Unverified | 0 |
| Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation | Aug 9, 2018 | Facial expression generationImage-to-Image Translation | —Unverified | 0 |
| Audio-Driven Co-Speech Gesture Video Generation | Dec 5, 2022 | Video Generation | —Unverified | 0 |
| FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality | Oct 25, 2024 | Video Generation | —Unverified | 0 |
| Fast Autoregressive Video Generation with Diagonal Decoding | Mar 18, 2025 | Video Generation | —Unverified | 0 |
| Contrastive Video Textures | Jan 1, 2021 | Contrastive LearningVideo Generation | —Unverified | 0 |
| AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers | Mar 25, 2025 | Video Generation | —Unverified | 0 |
| Fashion-VDM: Video Diffusion Model for Virtual Try-On | Oct 31, 2024 | Video GenerationVirtual Try-on | —Unverified | 0 |
| Continuous-Time Video Generation via Learning Motion Dynamics with Neural ODE | Dec 21, 2021 | Unconditional Video GenerationVideo Generation | —Unverified | 0 |
| Continuously Controllable Facial Expression Editing in Talking Face Videos | Sep 17, 2022 | Image-to-Image TranslationVideo Generation | —Unverified | 0 |
| Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Dec 21, 2023 | Synthetic Data GenerationVideo Generation | —Unverified | 0 |
| ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction | Oct 7, 2024 | multimodal generationStory Generation | —Unverified | 0 |
| Face Consistency Benchmark for GenAI Video | May 16, 2025 | Video Generation | —Unverified | 0 |
| Facial Expression Video Generation Based-On Spatio-temporal Convolutional GAN: FEV-GAN | Oct 20, 2022 | Facial expression generationVideo Generation | —Unverified | 0 |
| Face Video Generation from a Single Image and Landmarks | Apr 25, 2019 | Image-to-Image TranslationTranslation | —Unverified | 0 |
| Contextual RNN-GANs for Abstract Reasoning Diagram Generation | Sep 29, 2016 | Generative Adversarial NetworkVideo Generation | —Unverified | 0 |
| FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset | Sep 23, 2024 | Image GenerationUnconditional Video Generation | —Unverified | 0 |
| FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability | Dec 6, 2023 | Face ModelVideo Generation | —Unverified | 0 |
| Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation | Feb 11, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation | Aug 29, 2024 | AllVideo Generation | —Unverified | 0 |
| Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis | Apr 30, 2025 | Disparity EstimationTransparent objects | —Unverified | 0 |
| Context-aware Talking Face Video Generation | Feb 28, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 |
| AtomoVideo: High Fidelity Image-to-Video Generation | Mar 4, 2024 | Image GenerationImage to Video Generation | —Unverified | 0 |
| Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method | May 7, 2024 | Video Generation | —Unverified | 0 |
| ContentV: Efficient Training of Video Generation Models with Limited Compute | Jun 5, 2025 | Image GenerationVideo Generation | —Unverified | 0 |
| AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Mar 26, 2025 | Autonomous DrivingNeRF | —Unverified | 0 |
| Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training | Dec 8, 2024 | Video Generation | —Unverified | 0 |
| Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey | Nov 5, 2024 | 3D Scene ReconstructionAutonomous Driving | —Unverified | 0 |
| Exploring the Hyperparameter Space of Image Diffusion Models for Echocardiogram Generation | Nov 2, 2023 | Video Generation | —Unverified | 0 |
| ATI: Any Trajectory Instruction for Controllable Video Generation | May 28, 2025 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| Explorative Inbetweening of Time and Space | Mar 21, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| Consistent Zero-shot 3D Texture Synthesis Using Geometry-aware Diffusion and Temporal Video Models | Jun 26, 2025 | Texture SynthesisVideo Generation | —Unverified | 0 |
| Explaining Vision and Language through Graphs of Events in Space and Time | Aug 29, 2023 | Graph MatchingVideo Generation | —Unverified | 0 |
| Every Smile is Unique: Landmark-Guided Diverse Smile Generation | Feb 6, 2018 | Video Generation | —Unverified | 0 |
| AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations | Oct 17, 2024 | DecoderQuantization | —Unverified | 0 |
| 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Oct 21, 2024 | 3DGSDecoder | —Unverified | 0 |
| Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation | Apr 17, 2023 | Image GenerationSuper-Resolution | —Unverified | 0 |
| Every Image Listens, Every Image Dances: Music-Driven Image Animation | Jan 30, 2025 | Image AnimationVideo Generation | —Unverified | 0 |
| Everybody Sign Now: Translating Spoken Language to Photo Realistic Sign Language Video | Nov 19, 2020 | Sign Language ProductionVideo Generation | —Unverified | 0 |
| CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion | Jun 7, 2024 | SchedulingVideo Generation | —Unverified | 0 |
| Event-based High Dynamic Range Image and Very High Frame Rate Video Generation using Conditional Generative Adversarial Networks | Nov 20, 2018 | Video GenerationVocal Bursts Intensity Prediction | —Unverified | 0 |
| ASurvey: Spatiotemporal Consistency in Video Generation | Feb 25, 2025 | Image GenerationVideo Generation | —Unverified | 0 |