| LEO: Generative Latent Image Animator for Human Video Synthesis | May 6, 2023 | DisentanglementVideo Editing | CodeCode Available | 1 | 5 |
| Layered Neural Atlases for Consistent Video Editing | Sep 23, 2021 | Style TransferVideo Editing | CodeCode Available | 1 | 5 |
| LOVECon: Text-driven Training-Free Long Video Editing with ControlNet | Oct 15, 2023 | Style TransferVideo Editing | CodeCode Available | 1 | 5 |
| Implicit Motion Function | Jan 1, 2024 | DecoderOptical Flow Estimation | CodeCode Available | 1 | 5 |
| RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives | May 28, 2024 | AttributeVideo Editing | CodeCode Available | 1 | 5 |
| Re-Attentional Controllable Video Diffusion Editing | Dec 16, 2024 | DenoisingVideo Editing | CodeCode Available | 1 | 5 |
| COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing | Jun 13, 2024 | DenoisingGPU | CodeCode Available | 1 | 5 |
| InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction | Mar 26, 2025 | Instruction FollowingVideo Editing | CodeCode Available | 1 | 5 |
| Combating Online Misinformation Videos: Characterization, Detection, and Future Directions | Feb 7, 2023 | MisinformationRecommendation Systems | CodeCode Available | 1 | 5 |
| CVPR 2023 Text Guided Video Editing Competition | Oct 24, 2023 | Video EditingVideo Generation | CodeCode Available | 1 | 5 |
| AutoTransition: Learning to Recommend Video Transition Effects | Jul 27, 2022 | RetrievalVideo Editing | CodeCode Available | 1 | 5 |
| Learning to Cut by Watching Movies | Aug 9, 2021 | Contrastive LearningVideo Editing | CodeCode Available | 1 | 5 |
| OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models | Nov 15, 2024 | Optical Flow EstimationText-to-Video Generation | CodeCode Available | 1 | 5 |
| A Light Weight Model for Active Speaker Detection | Mar 8, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 | 5 |
| Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models | Oct 2, 2023 | AttributeOptical Flow Estimation | CodeCode Available | 1 | 5 |
| Patch-based Object-centric Transformers for Efficient Video Generation | Jun 8, 2022 | ObjectVideo Editing | CodeCode Available | 1 | 5 |
| Consistent Video-to-Video Transfer Using Synthetic Dataset | Nov 1, 2023 | Video Editing | CodeCode Available | 1 | 5 |
| 1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation | Jun 7, 2023 | Autonomous DrivingPanoptic Segmentation | CodeCode Available | 1 | 5 |
| MagicStick: Controllable Video Editing via Control Handle Transformations | Dec 5, 2023 | Video EditingVideo Generation | CodeCode Available | 1 | 5 |
| Pix2Video: Video Editing using Image Diffusion | Mar 22, 2023 | DenoisingText Generation | CodeCode Available | 1 | 5 |
| Rethinking the Architecture Design for Efficient Generic Event Boundary Detection | Jul 17, 2024 | Boundary DetectionGeneric Event Boundary Detection | CodeCode Available | 1 | 5 |
| CCEdit: Creative and Controllable Video Editing via Diffusion Models | Sep 28, 2023 | Image GenerationText-to-Image Generation | CodeCode Available | 1 | 5 |
| FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models | Jun 1, 2023 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 1 | 5 |
| Feature Combination Meets Attention: Baidu Soccer Embeddings and Transformer based Temporal Detection | Jun 28, 2021 | Action RecognitionAction Spotting | CodeCode Available | 1 | 5 |
| FADE: Frequency-Aware Diffusion Model Factorization for Video Editing | Jun 6, 2025 | Video Editing | CodeCode Available | 1 | 5 |