| RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping | Jun 10, 2025 | Video Editing | —Unverified | 0 |
| Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding | Jun 9, 2025 | Contrastive LearningVideo Editing | —Unverified | 0 |
| FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing | Jun 5, 2025 | Text-to-Video EditingVideo Editing | —Unverified | 0 |
| FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers | Jun 4, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| MiniMax-Remover: Taming Bad Noise Helps Video Object Removal | May 30, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| Video Editing for Audio-Visual Dubbing | May 29, 2025 | Video Editing | CodeCode Available | 0 |
| TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs | May 26, 2025 | BenchmarkingLarge Language Model | —Unverified | 0 |
| SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation | May 25, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Shots to Stories: LLM-Assisted Video Editing with Unified Language Representations | May 18, 2025 | Video EditingVideo Understanding | —Unverified | 0 |