| Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding | Jun 9, 2025 | Contrastive LearningVideo Editing | —Unverified | 0 |
| Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges | Dec 4, 2024 | Code GenerationImage Comprehension | —Unverified | 0 |
| Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets | Sep 2, 2024 | Video AlignmentVideo Editing | —Unverified | 0 |
| Task-agnostic Temporally Consistent Facial Video Editing | Jul 3, 2020 | 3D ReconstructionVideo Editing | —Unverified | 0 |
| TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs | May 26, 2025 | BenchmarkingLarge Language Model | —Unverified | 0 |
| Temporally Consistent Semantic Video Editing | Jun 21, 2022 | Image GenerationVideo Editing | —Unverified | 0 |
| Text-based Talking Video Editing with Cascaded Conditional Diffusion | Jul 20, 2024 | Video Editing | —Unverified | 0 |
| FacialFilmroll: High-resolution multi-shot video editing | Oct 5, 2021 | Face ModelVideo Editing | —Unverified | 0 |
| Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs | Jan 10, 2025 | Video Editing | —Unverified | 0 |
| Text-Video Multi-Grained Integration for Video Moment Montage | Dec 12, 2024 | SentenceVideo Editing | —Unverified | 0 |