| Endora: Video Generation Models as Endoscopy Simulators | Mar 17, 2024 | Data AugmentationVideo Generation | —Unverified | 0 |
| Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation | Dec 12, 2024 | Video Generation | —Unverified | 0 |
| Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory | Dec 23, 2024 | Video Generation | —Unverified | 0 |
| EQ-TAA: Equivariant Traffic Accident Anticipation via Diffusion-Based Accident Video Synthesis | Mar 16, 2025 | Accident AnticipationVideo Generation | —Unverified | 0 |
| EVA: An Embodied World Model for Future Video Anticipation | Oct 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Robot Policies in a World Model | May 31, 2025 | modelVideo Generation | —Unverified | 0 |
| EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation | Mar 24, 2025 | BenchmarkingData Augmentation | —Unverified | 0 |
| Event-based High Dynamic Range Image and Very High Frame Rate Video Generation using Conditional Generative Adversarial Networks | Nov 20, 2018 | Video GenerationVocal Bursts Intensity Prediction | —Unverified | 0 |
| Everybody Sign Now: Translating Spoken Language to Photo Realistic Sign Language Video | Nov 19, 2020 | Sign Language ProductionVideo Generation | —Unverified | 0 |
| Every Image Listens, Every Image Dances: Music-Driven Image Animation | Jan 30, 2025 | Image AnimationVideo Generation | —Unverified | 0 |
| Every Smile is Unique: Landmark-Guided Diverse Smile Generation | Feb 6, 2018 | Video Generation | —Unverified | 0 |
| Explaining Vision and Language through Graphs of Events in Space and Time | Aug 29, 2023 | Graph MatchingVideo Generation | —Unverified | 0 |
| Explorative Inbetweening of Time and Space | Mar 21, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| Exploring the Hyperparameter Space of Image Diffusion Models for Echocardiogram Generation | Nov 2, 2023 | Video Generation | —Unverified | 0 |
| Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey | Nov 5, 2024 | 3D Scene ReconstructionAutonomous Driving | —Unverified | 0 |
| Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method | May 7, 2024 | Video Generation | —Unverified | 0 |
| Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis | Apr 30, 2025 | Disparity EstimationTransparent objects | —Unverified | 0 |
| FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability | Dec 6, 2023 | Face ModelVideo Generation | —Unverified | 0 |
| FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset | Sep 23, 2024 | Image GenerationUnconditional Video Generation | —Unverified | 0 |
| Face Video Generation from a Single Image and Landmarks | Apr 25, 2019 | Image-to-Image TranslationTranslation | —Unverified | 0 |
| Facial Expression Video Generation Based-On Spatio-temporal Convolutional GAN: FEV-GAN | Oct 20, 2022 | Facial expression generationVideo Generation | —Unverified | 0 |
| Fashion-VDM: Video Diffusion Model for Virtual Try-On | Oct 31, 2024 | Video GenerationVirtual Try-on | —Unverified | 0 |
| Fast Autoregressive Video Generation with Diagonal Decoding | Mar 18, 2025 | Video Generation | —Unverified | 0 |
| FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality | Oct 25, 2024 | Video Generation | —Unverified | 0 |
| Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions | Jul 27, 2024 | Computational EfficiencyVideo Generation | —Unverified | 0 |
| FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing | Mar 10, 2024 | Image GenerationText-to-Video Editing | —Unverified | 0 |
| FFA Sora, video generation as fundus fluorescein angiography simulator | Dec 23, 2024 | Privacy PreservingQuestion Answering | —Unverified | 0 |
| FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation | Jul 9, 2025 | DescriptiveText Generation | —Unverified | 0 |
| Fine-gained Zero-shot Video Sampling | Jul 31, 2024 | Image GenerationVideo Editing | —Unverified | 0 |
| Fine-grained Controllable Video Generation via Object Appearance and Context | Dec 5, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance | May 19, 2025 | Action GenerationHuman action generation | —Unverified | 0 |
| FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos | Apr 14, 2025 | Video Generation | —Unverified | 0 |
| Fisher Flow Matching for Generative Modeling over Discrete Data | May 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FlashVideo: A Framework for Swift Inference in Text-to-Video Generation | Dec 30, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| FlexCache: Flexible Approximate Cache System for Video Diffusion | Dec 18, 2024 | Video Generation | —Unverified | 0 |
| FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute | Feb 27, 2025 | DenoisingImage Generation | —Unverified | 0 |
| Fleximo: Towards Flexible Text-to-Human Motion Video Generation | Nov 29, 2024 | Image to Video GenerationLarge Language Model | —Unverified | 0 |
| FlexLip: A Controllable Text-to-Lip System | Jun 7, 2022 | Audio Generationtext-to-speech | —Unverified | 0 |
| FLIP: Flow-Centric Generative Planning as General-Purpose Manipulation World Model | Dec 11, 2024 | Representation LearningVideo Generation | —Unverified | 0 |
| FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait | Dec 2, 2024 | Image AnimationVideo Generation | —Unverified | 0 |
| FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis | Feb 12, 2025 | Motion SynthesisOptical Flow Estimation | —Unverified | 0 |
| FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax | Nov 27, 2023 | Video Generation | —Unverified | 0 |
| FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video | Mar 6, 2025 | Future predictionNovel View Synthesis | —Unverified | 0 |
| Follow-Your-Creation: Empowering 4D Creation through Video Inpainting | Jun 5, 2025 | Video GenerationVideo Inpainting | —Unverified | 0 |
| Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance | Dec 21, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control | Jun 5, 2024 | Image AnimationVideo Generation | —Unverified | 0 |
| Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals | May 26, 2025 | DiversityVideo Generation | —Unverified | 0 |
| FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion | Jun 5, 2025 | DenoisingQuantization | —Unverified | 0 |
| FrameBridge: Improving Image-to-Video Generation with Bridge Models | Oct 20, 2024 | Image AnimationImage to Video Generation | —Unverified | 0 |
| Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Mar 28, 2024 | Image GenerationVideo Generation | —Unverified | 0 |