| FADE: Frequency-Aware Diffusion Model Factorization for Video Editing | Jun 6, 2025 | Video Editing | CodeCode Available | 1 |
| Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing | May 29, 2025 | Optical Flow EstimationVideo Editing | CodeCode Available | 1 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction | Mar 26, 2025 | Instruction FollowingVideo Editing | CodeCode Available | 1 |
| Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion | Jan 8, 2025 | Video Editing | CodeCode Available | 1 |
| DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Dec 27, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 1 |
| Re-Attentional Controllable Video Diffusion Editing | Dec 16, 2024 | DenoisingVideo Editing | CodeCode Available | 1 |
| OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models | Nov 15, 2024 | Optical Flow EstimationText-to-Video Generation | CodeCode Available | 1 |
| Rethinking the Architecture Design for Efficient Generic Event Boundary Detection | Jul 17, 2024 | Boundary DetectionGeneric Event Boundary Detection | CodeCode Available | 1 |
| MVOC: a training-free multiple video object composition method with diffusion models | Jun 22, 2024 | Image to Video GenerationObject | CodeCode Available | 1 |
| COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing | Jun 13, 2024 | DenoisingGPU | CodeCode Available | 1 |
| Streaming Video Diffusion: Online Video Editing with Diffusion Models | May 30, 2024 | Video Editing | CodeCode Available | 1 |
| RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives | May 28, 2024 | AttributeVideo Editing | CodeCode Available | 1 |
| EffiVED:Efficient Video Editing via Text-instruction Diffusion Models | Mar 18, 2024 | Video Editing | CodeCode Available | 1 |
| AICL: Action In-Context Learning for Video Diffusion Model | Mar 18, 2024 | Action GenerationIn-Context Learning | CodeCode Available | 1 |
| FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF | Jan 5, 2024 | NeRFVideo Editing | CodeCode Available | 1 |
| Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | Jan 3, 2024 | Image AnimationVideo Editing | CodeCode Available | 1 |
| Implicit Motion Function | Jan 1, 2024 | DecoderOptical Flow Estimation | CodeCode Available | 1 |
| A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing | Dec 10, 2023 | Video Editing | CodeCode Available | 1 |
| MotionCrafter: One-Shot Motion Customization of Diffusion Models | Dec 8, 2023 | DisentanglementMotion Disentanglement | CodeCode Available | 1 |
| MagicStick: Controllable Video Editing via Control Handle Transformations | Dec 5, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models | Dec 5, 2023 | Image GenerationModel Selection | CodeCode Available | 1 |
| DragVideo: Interactive Drag-style Video Editing | Dec 3, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models | Dec 1, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| MotionEditor: Editing Video Motion via Content-Aware Diffusion | Nov 30, 2023 | Video Editing | CodeCode Available | 1 |
| MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing | Nov 29, 2023 | DenoisingImage to Video Generation | CodeCode Available | 1 |
| Consistent Video-to-Video Transfer Using Synthetic Dataset | Nov 1, 2023 | Video Editing | CodeCode Available | 1 |
| CVPR 2023 Text Guided Video Editing Competition | Oct 24, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| LOVECon: Text-driven Training-Free Long Video Editing with ControlNet | Oct 15, 2023 | Style TransferVideo Editing | CodeCode Available | 1 |
| Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models | Oct 2, 2023 | AttributeOptical Flow Estimation | CodeCode Available | 1 |
| CCEdit: Creative and Controllable Video Editing via Diffusion Models | Sep 28, 2023 | Image GenerationText-to-Image Generation | CodeCode Available | 1 |
| 1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation | Aug 28, 2023 | Autonomous DrivingDenoising | CodeCode Available | 1 |
| 1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation | Jun 7, 2023 | Autonomous DrivingPanoptic Segmentation | CodeCode Available | 1 |
| DVIS: Decoupled Video Instance Segmentation Framework | Jun 6, 2023 | Autonomous DrivingGPU | CodeCode Available | 1 |
| dugMatting: Decomposed-Uncertainty-Guided Matting | Jun 2, 2023 | Image MattingVideo Editing | CodeCode Available | 1 |
| FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models | Jun 1, 2023 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 1 |
| SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing | May 30, 2023 | Style TransferVideo Editing | CodeCode Available | 1 |
| A Dual-level Detection Method for Video Copy Detection | May 21, 2023 | Copy DetectionPartial Video Copy Detection | CodeCode Available | 1 |
| LEO: Generative Latent Image Animator for Human Video Synthesis | May 6, 2023 | DisentanglementVideo Editing | CodeCode Available | 1 |
| Soundini: Sound-Guided Diffusion for Natural Video Editing | Apr 13, 2023 | DenoisingOptical Flow Estimation | CodeCode Available | 1 |
| VIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANs | Mar 28, 2023 | Optical Flow EstimationVideo Editing | CodeCode Available | 1 |
| Pix2Video: Video Editing using Image Diffusion | Mar 22, 2023 | DenoisingText Generation | CodeCode Available | 1 |
| A Light Weight Model for Active Speaker Detection | Mar 8, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 |
| Combating Online Misinformation Videos: Characterization, Detection, and Future Directions | Feb 7, 2023 | MisinformationRecommendation Systems | CodeCode Available | 1 |
| Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward | Sep 25, 2022 | DecoderVideo Editing | CodeCode Available | 1 |
| AutoTransition: Learning to Recommend Video Transition Effects | Jul 27, 2022 | RetrievalVideo Editing | CodeCode Available | 1 |
| The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing | Jul 20, 2022 | AnatomyVideo Editing | CodeCode Available | 1 |
| Patch-based Object-centric Transformers for Efficient Video Generation | Jun 8, 2022 | ObjectVideo Editing | CodeCode Available | 1 |
| Deformable Sprites for Unsupervised Video Decomposition | Apr 14, 2022 | Video Editing | CodeCode Available | 1 |
| Transcoded Video Restoration by Temporal Spatial Auxiliary Network | Dec 15, 2021 | Video EditingVideo Restoration | CodeCode Available | 1 |