SOTAVerified

Video Editing

Papers

Showing 251300 of 346 papers

TitleStatusHype
INVE: Interactive Neural Video Editing0
Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models0
InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models0
IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion0
LAMP: Learn A Motion Pattern for Few-Shot Video Generation0
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers0
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing0
Learning 3D Particle-based Simulators from RGB-D Videos0
Learning the What and How of Annotation in Video Object Segmentation0
Let Your Video Listen to Your Music!0
Leveraging Frame Affinity for sRGB-to-RAW Video De-rendering0
LoRA-Edit: Controllable First-Frame-Guided Video Editing via Mask-Aware LoRA Fine-Tuning0
MagicEdit: High-Fidelity and Temporally Coherent Video Editing0
VEU-Bench: Towards Comprehensive Understanding of Video Editing0
VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing0
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing0
Video Decomposition Prior: A Methodology to Decompose Videos into Layers0
VideoDirector: Precise Video Editing via Text-to-Video Models0
Video Editing for Video Retrieval0
Video Editing via Factorized Diffusion Distillation0
Video Editing with Temporal, Spatial and Appearance Consistency0
Video Forgery Detection for Surveillance Cameras: A Review0
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing0
VideoGUI: A Benchmark for GUI Automation from Instructional Videos0
VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors0
Video Inpainting of Complex Scenes0
VideoMap: Supporting Video Editing Exploration, Brainstorming, and Prototyping in the Latent Space0
Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion0
VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing0
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence0
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models0
VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs0
Visual Prompting for One-shot Controllable Video Editing without Inversion0
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing0
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data0
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis0
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens0
Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising0
Zero-Shot Video Editing through Adaptive Sliding Score Distillation0
Zero-Shot Video Question Answering with Procedural Programs0
Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding0
Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges0
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets0
Task-agnostic Temporally Consistent Facial Video Editing0
TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs0
Temporally Consistent Semantic Video Editing0
Text-based Talking Video Editing with Cascaded Conditional Diffusion0
FacialFilmroll: High-resolution multi-shot video editing0
Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs0
Text-Video Multi-Grained Integration for Video Moment Montage0
Show:102550
← PrevPage 6 of 7Next →

No leaderboard results yet.