| CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers | Feb 10, 2025 | Image GenerationVideo Generation | —Unverified | 0 |
| Anchored Diffusion for Video Face Reenactment | Jul 21, 2024 | Face ReenactmentVideo Generation | —Unverified | 0 |
| CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Jan 18, 2024 | ObjectText-to-Video Generation | —Unverified | 0 |
| Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models | Nov 26, 2024 | Reinforcement Learning (RL)Text-to-Video Generation | —Unverified | 0 |
| Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting | Nov 19, 2024 | 3D GenerationGPU | —Unverified | 0 |
| NewMove: Customizing text-to-video models with novel motions | Dec 7, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Framer: Interactive Frame Interpolation | Oct 24, 2024 | Image MorphingVideo Generation | —Unverified | 0 |
| Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Feb 22, 2024 | Video Generation | —Unverified | 0 |
| Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Mar 28, 2024 | Image GenerationVideo Generation | —Unverified | 0 |
| AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation | Nov 26, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Dec 10, 2024 | Video Generation | —Unverified | 0 |
| FrameBridge: Improving Image-to-Video Generation with Bridge Models | Oct 20, 2024 | Image AnimationImage to Video Generation | —Unverified | 0 |
| FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion | Jun 5, 2025 | DenoisingQuantization | —Unverified | 0 |
| Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals | May 26, 2025 | DiversityVideo Generation | —Unverified | 0 |
| Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control | Jun 5, 2024 | Image AnimationVideo Generation | —Unverified | 0 |
| AutoLV: Automatic Lecture Video Generator | Sep 19, 2022 | Speech SynthesisTalking Head Generation | —Unverified | 0 |
| Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance | Dec 21, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes | May 30, 2025 | counterfactualVideo Generation | —Unverified | 0 |
| Follow-Your-Creation: Empowering 4D Creation through Video Inpainting | Jun 5, 2025 | Video GenerationVideo Inpainting | —Unverified | 0 |
| Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Apr 15, 2024 | GPUImage Generation | —Unverified | 0 |
| Autoencoding Video Latents for Adversarial Video Generation | Jan 18, 2022 | Image GenerationVideo Generation | —Unverified | 0 |
| FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video | Mar 6, 2025 | Future predictionNovel View Synthesis | —Unverified | 0 |
| FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax | Nov 27, 2023 | Video Generation | —Unverified | 0 |
| Cross-View Exocentric to Egocentric Video Synthesis | Jul 7, 2021 | Generative Adversarial NetworkVideo Generation | —Unverified | 0 |
| FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis | Feb 12, 2025 | Motion SynthesisOptical Flow Estimation | —Unverified | 0 |
| FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait | Dec 2, 2024 | Image AnimationVideo Generation | —Unverified | 0 |
| Cross-Modal Learning for Music-to-Music-Video Description Generation | Mar 14, 2025 | Video DescriptionVideo Generation | —Unverified | 0 |
| FLIP: Flow-Centric Generative Planning as General-Purpose Manipulation World Model | Dec 11, 2024 | Representation LearningVideo Generation | —Unverified | 0 |
| FlexLip: A Controllable Text-to-Lip System | Jun 7, 2022 | Audio Generationtext-to-speech | —Unverified | 0 |
| Fleximo: Towards Flexible Text-to-Human Motion Video Generation | Nov 29, 2024 | Image to Video GenerationLarge Language Model | —Unverified | 0 |
| CPA: Camera-pose-awareness Diffusion Transformer for Video Generation | Dec 2, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation | May 7, 2024 | Face GenerationTalking Face Generation | —Unverified | 0 |
| Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts | May 22, 2025 | Dialogue GenerationLarge Language Model | —Unverified | 0 |
| InfinityDrive: Breaking Time Limits in Driving World Models | Dec 2, 2024 | Autonomous DrivingDiversity | —Unverified | 0 |
| Instructional Video Generation | Dec 5, 2024 | Video Generation | —Unverified | 0 |
| FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute | Feb 27, 2025 | DenoisingImage Generation | —Unverified | 0 |
| Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement | Jan 1, 2025 | Gesture GenerationMotion Generation | —Unverified | 0 |
| FlexCache: Flexible Approximate Cache System for Video Diffusion | Dec 18, 2024 | Video Generation | —Unverified | 0 |
| Audio-Sync Video Generation with Multi-Stream Temporal Control | Jun 9, 2025 | Audio-Visual SynchronizationVideo Alignment | —Unverified | 0 |
| FlashVideo: A Framework for Swift Inference in Text-to-Video Generation | Dec 30, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Fisher Flow Matching for Generative Modeling over Discrete Data | May 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Copy Motion From One to Another: Fake Motion Video Generation | May 3, 2022 | Video Generation | —Unverified | 0 |
| Audio-Driven Talking Face Video Generation with Dynamic Convolution Kernels | Jan 16, 2022 | Video Generation | —Unverified | 0 |
| FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos | Apr 14, 2025 | Video Generation | —Unverified | 0 |
| FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance | May 19, 2025 | Action GenerationHuman action generation | —Unverified | 0 |
| Fine-grained Controllable Video Generation via Object Appearance and Context | Dec 5, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Convergence of Diffusion Models Under the Manifold Hypothesis in High-Dimensions | Sep 27, 2024 | DenoisingGaussian Processes | —Unverified | 0 |
| Fine-gained Zero-shot Video Sampling | Jul 31, 2024 | Image GenerationVideo Editing | —Unverified | 0 |
| Controllable Video Generation With Sparse Trajectories | Jun 1, 2018 | Video GenerationVideo Prediction | —Unverified | 0 |
| Audio-driven Gesture Generation via Deviation Feature in the Latent Space | Mar 27, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |