| Sound-Guided Semantic Video Generation | Apr 20, 2022 | Video EditingVideo Generation | —Unverified | 0 |
| Soundify: Matching Sound Effects to Video | Dec 17, 2021 | Audio GenerationImage Classification | —Unverified | 0 |
| Spatio-temporal Action Recognition: A Survey | Jan 27, 2019 | Action DetectionAction Localization | —Unverified | 0 |
| Speech Driven Video Editing via an Audio-Conditioned Diffusion Model | Jan 10, 2023 | DenoisingFace Model | —Unverified | 0 |
| Speech Prediction in Silent Videos using Variational Autoencoders | Nov 14, 2020 | PredictionVideo Editing | —Unverified | 0 |
| SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation | May 25, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields | Dec 7, 2022 | Video Editing | —Unverified | 0 |
| ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs | Nov 30, 2023 | Image GenerationNeRF | —Unverified | 0 |
| 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 12, 2024 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| 2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation | Jun 1, 2024 | Autonomous DrivingPanoptic Segmentation | —Unverified | 0 |
| Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions | Mar 11, 2024 | counterfactualVideo Editing | —Unverified | 0 |
| A Deep Multiscale Framework for Video Watermarking | Mar 28, 2023 | Video Editing | —Unverified | 0 |
| Adversarial Attacks on Video Object Segmentation with Hard Region Discovery | Sep 25, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| AI based approach to Trailer Generation for Online Educational Courses | Jan 10, 2023 | Video Editing | —Unverified | 0 |
| Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space | Jan 1, 2025 | Image-to-Image TranslationVideo Editing | —Unverified | 0 |
| Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing | Jan 1, 2025 | DenoisingVideo Editing | —Unverified | 0 |
| Analysis of Attention in Video Diffusion Transformers | Apr 14, 2025 | Video Editing | —Unverified | 0 |
| AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment | Apr 7, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| Anything in Any Scene: Photorealistic Video Object Insertion | Jan 30, 2024 | Data AugmentationObject | —Unverified | 0 |
| A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories | May 2, 2025 | Code GenerationText Generation | —Unverified | 0 |
| Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN | Jul 8, 2024 | DisentanglementVideo Editing | —Unverified | 0 |
| Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos | Aug 20, 2024 | Video Editing | —Unverified | 0 |
| AutoMatch: A Large-scale Audio Beat Matching Benchmark for Boosting Deep Learning Assistant Video Editing | Mar 3, 2023 | Video Editing | —Unverified | 0 |
| Automatically Extract the Semi-transparent Motion-blurred Hand from a Single Image | Jun 27, 2019 | DecoderVideo Editing | —Unverified | 0 |
| Automatic Curation of Golf Highlights using Multimodal Excitement Features | Jul 22, 2017 | Action RecognitionRetrieval | —Unverified | 0 |
| Automatic Non-Linear Video Editing Transfer | May 14, 2021 | Video Editing | —Unverified | 0 |
| Blended Latent Diffusion under Attention Control for Real-World Video Editing | Sep 5, 2024 | Image GenerationText to Image Generation | —Unverified | 0 |
| B-Script: Transcript-based B-roll Video Editing with Recommendations | Feb 28, 2019 | Video Editing | —Unverified | 0 |
| Calipso: Physics-based Image and Video Editing through CAD Model Proxies | Aug 12, 2017 | Video Editing | —Unverified | 0 |
| CAMEL: CAusal Motion Enhancement Tailored for Lifting Text-driven Video Editing | Jan 1, 2024 | Video Editing | —Unverified | 0 |
| CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models | Apr 13, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| Make VLM Recognize Visual Hallucination on Cartoon Character Image with Pose Information | Mar 22, 2024 | 3D ReconstructionHallucination | —Unverified | 0 |
| Clarification of Video Retrieval Query Results by the Automated Insertion of Supporting Shots | Feb 19, 2021 | RetrievalVideo Editing | —Unverified | 0 |
| Consistent and Controllable Image Animation with Motion Diffusion Models | Jan 1, 2025 | Image AnimationVideo Editing | —Unverified | 0 |
| Consistent Depth of Moving Objects in Video | Aug 2, 2021 | Depth EstimationDepth Prediction | —Unverified | 0 |
| Controllable Weather Synthesis and Removal with Video Diffusion Models | May 1, 2025 | Video Editing | —Unverified | 0 |
| Counteracting temporal attacks in Video Copy Detection | Jan 19, 2025 | Copy DetectionVideo Editing | —Unverified | 0 |
| CSS-Segment: 2nd Place Report of LSVOS Challenge VOS Track | Aug 24, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Apr 15, 2024 | GPUImage Generation | —Unverified | 0 |
| Cut-and-Paste: Subject-Driven Video Editing with Attention Control | Nov 20, 2023 | ObjectVideo Editing | —Unverified | 0 |
| Dance In the Wild: Monocular Human Animation with Neural Dynamic Appearance Synthesis | Nov 10, 2021 | Human Animationmotion retargeting | —Unverified | 0 |
| DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models | May 11, 2025 | parameter-efficient fine-tuningVideo Alignment | —Unverified | 0 |
| DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency | Aug 14, 2024 | text-guided-image-editingVideo Editing | —Unverified | 0 |
| Designing a 3D-Aware StyleNeRF Encoder for Face Editing | Feb 19, 2023 | AttributeFace Model | —Unverified | 0 |
| DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing | Jun 26, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing | Dec 5, 2023 | ObjectVideo Editing | —Unverified | 0 |
| Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding | Dec 6, 2022 | Video Editing | —Unverified | 0 |
| Disentangled Multidimensional Metric Learning for Music Similarity | Aug 9, 2020 | Metric LearningSpecificity | —Unverified | 0 |
| DIVE: Taming DINO for Subject-Driven Video Editing | Dec 4, 2024 | Image GenerationVideo Editing | —Unverified | 0 |