SOTAVerified

Video Editing

Papers

Showing 301346 of 346 papers

TitleStatusHype
The ALOS Dataset for Advert Localization in Outdoor Scenes0
The Curious Case of End Token: A Zero-Shot Disentangled Image Editing using CLIP0
Towards Consistent Video Editing with Text-to-Image Diffusion Models0
Towards Data-Driven Automatic Video Editing0
Training-Free Robust Interactive Video Object Segmentation0
Trajectory Attention for Fine-grained Video Motion Control0
Transformer-based Image and Video Inpainting: Current Challenges and Future Directions0
Understanding Attention Mechanism in Video Diffusion Models0
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing0
UniFaceGAN: A Unified Framework for Temporally Consistent Facial Video Editing0
Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection0
Unity in Diversity: Video Editing via Gradient-Latent Purification0
Unsupervised Facial Performance Editing via Vector-Quantized StyleGAN Representations0
UVCG: Leveraging Temporal Consistency for Universal Video Protection0
UVL2: A Unified Framework for Video Tampering Localization0
V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection0
V2Edit: Versatile Video Diffusion Editor for Videos and 3D Scenes0
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos0
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation0
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video EditingCode0
Face Mask Removal with Region-attentive Face InpaintingCode0
Movie Genre Classification by Language Augmentation and Shot SamplingCode0
Edit Temporal-Consistent Videos with Image Diffusion ModelCode0
Shot Sequence Ordering for Video Editing: Benchmarks, Metrics, and Cinematology-Inspired Computing MethodsCode0
Video Editing for Audio-Visual DubbingCode0
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion ModelsCode0
EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing ModelsCode0
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image GenerationCode0
ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025Code0
Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGANCode0
Detecting Kissing Scenes in a Database of Hollywood FilmsCode0
Causally Steered Diffusion for Automated Video Counterfactual GenerationCode0
A Temporally-Aware Interpolation Network for Video Frame InpaintingCode0
Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion PriorCode0
Point-to-Point Video GenerationCode0
Fast Deep Matting for Portrait Animation on Mobile PhoneCode0
OpenKinoAI: An Open Source Framework for Intelligent Cinematography and Editing of Live PerformancesCode0
Language-Driven Interactive Shadow DetectionCode0
APES: Audiovisual Person Search in Untrimmed VideoCode0
BodyNet: Volumetric Inference of 3D Human Body ShapesCode0
ICface: Interpretable and Controllable Face Reenactment Using GANsCode0
Cross-modal Cognitive Consensus guided Audio-Visual SegmentationCode0
Benchmarking the Robustness of Optical Flow Estimation to CorruptionsCode0
Modelling Latent Dynamics of StyleGAN using Neural ODEsCode0
Efficient Neural Network Encoding for 3D Color Lookup TablesCode0
Video Acceleration MagnificationCode0
Show:102550
← PrevPage 7 of 7Next →

No leaderboard results yet.