| Sound-Guided Semantic Video Generation | Apr 20, 2022 | Video EditingVideo Generation | —Unverified | 0 |
| Soundify: Matching Sound Effects to Video | Dec 17, 2021 | Audio GenerationImage Classification | —Unverified | 0 |
| Spatio-temporal Action Recognition: A Survey | Jan 27, 2019 | Action DetectionAction Localization | —Unverified | 0 |
| Speech Driven Video Editing via an Audio-Conditioned Diffusion Model | Jan 10, 2023 | DenoisingFace Model | —Unverified | 0 |
| Speech Prediction in Silent Videos using Variational Autoencoders | Nov 14, 2020 | PredictionVideo Editing | —Unverified | 0 |
| SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation | May 25, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields | Dec 7, 2022 | Video Editing | —Unverified | 0 |
| ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs | Nov 30, 2023 | Image GenerationNeRF | —Unverified | 0 |
| 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 12, 2024 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| 2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation | Jun 1, 2024 | Autonomous DrivingPanoptic Segmentation | —Unverified | 0 |
| Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions | Mar 11, 2024 | counterfactualVideo Editing | —Unverified | 0 |
| A Deep Multiscale Framework for Video Watermarking | Mar 28, 2023 | Video Editing | —Unverified | 0 |
| Adversarial Attacks on Video Object Segmentation with Hard Region Discovery | Sep 25, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| AI based approach to Trailer Generation for Online Educational Courses | Jan 10, 2023 | Video Editing | —Unverified | 0 |
| Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space | Jan 1, 2025 | Image-to-Image TranslationVideo Editing | —Unverified | 0 |
| Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing | Jan 1, 2025 | DenoisingVideo Editing | —Unverified | 0 |
| Analysis of Attention in Video Diffusion Transformers | Apr 14, 2025 | Video Editing | —Unverified | 0 |
| AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment | Apr 7, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| Anything in Any Scene: Photorealistic Video Object Insertion | Jan 30, 2024 | Data AugmentationObject | —Unverified | 0 |
| A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories | May 2, 2025 | Code GenerationText Generation | —Unverified | 0 |
| Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN | Jul 8, 2024 | DisentanglementVideo Editing | —Unverified | 0 |
| Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos | Aug 20, 2024 | Video Editing | —Unverified | 0 |
| AutoMatch: A Large-scale Audio Beat Matching Benchmark for Boosting Deep Learning Assistant Video Editing | Mar 3, 2023 | Video Editing | —Unverified | 0 |
| Automatically Extract the Semi-transparent Motion-blurred Hand from a Single Image | Jun 27, 2019 | DecoderVideo Editing | —Unverified | 0 |