SOTAVerified

Video Editing

Papers

Showing 151200 of 346 papers

TitleStatusHype
Sound-Guided Semantic Video Generation0
Soundify: Matching Sound Effects to Video0
Spatio-temporal Action Recognition: A Survey0
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model0
Speech Prediction in Silent Videos using Variational Autoencoders0
SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation0
SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields0
Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding0
Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges0
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets0
Task-agnostic Temporally Consistent Facial Video Editing0
TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs0
Temporally Consistent Semantic Video Editing0
Text-based Talking Video Editing with Cascaded Conditional Diffusion0
FacialFilmroll: High-resolution multi-shot video editing0
Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs0
Text-Video Multi-Grained Integration for Video Moment Montage0
The ALOS Dataset for Advert Localization in Outdoor Scenes0
The Curious Case of End Token: A Zero-Shot Disentangled Image Editing using CLIP0
Towards Consistent Video Editing with Text-to-Image Diffusion Models0
Towards Data-Driven Automatic Video Editing0
Training-Free Robust Interactive Video Object Segmentation0
Trajectory Attention for Fine-grained Video Motion Control0
Transformer-based Image and Video Inpainting: Current Challenges and Future Directions0
Understanding Attention Mechanism in Video Diffusion Models0
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing0
UniFaceGAN: A Unified Framework for Temporally Consistent Facial Video Editing0
Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection0
Unity in Diversity: Video Editing via Gradient-Latent Purification0
Unsupervised Facial Performance Editing via Vector-Quantized StyleGAN Representations0
UVCG: Leveraging Temporal Consistency for Universal Video Protection0
UVL2: A Unified Framework for Video Tampering Localization0
V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes0
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos0
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation0
VEU-Bench: Towards Comprehensive Understanding of Video Editing0
VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing0
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing0
Video Decomposition Prior: A Methodology to Decompose Videos into Layers0
VideoDirector: Precise Video Editing via Text-to-Video Models0
Video Editing for Video Retrieval0
Video Editing via Factorized Diffusion Distillation0
Video Editing with Temporal, Spatial and Appearance Consistency0
Video Forgery Detection for Surveillance Cameras: A Review0
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing0
VideoGUI: A Benchmark for GUI Automation from Instructional Videos0
VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors0
Video Inpainting of Complex Scenes0
VideoMap: Supporting Video Editing Exploration, Brainstorming, and Prototyping in the Latent Space0
Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.