| SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Jan 13, 2025 | Objectobject-detection | CodeCode Available | 0 |
| IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion | Jan 13, 2025 | Video Editing | —Unverified | 0 |
| Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning | Jan 11, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs | Jan 10, 2025 | Video Editing | —Unverified | 0 |
| Edit as You See: Image-guided Video Editing via Masked Motion Modeling | Jan 8, 2025 | Optical Flow EstimationSelf-Supervised Learning | —Unverified | 0 |
| Unity in Diversity: Video Editing via Gradient-Latent Purification | Jan 1, 2025 | DiversityUnity | —Unverified | 0 |
| Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space | Jan 1, 2025 | Image-to-Image TranslationVideo Editing | —Unverified | 0 |
| Consistent and Controllable Image Animation with Motion Diffusion Models | Jan 1, 2025 | Image AnimationVideo Editing | —Unverified | 0 |
| VEU-Bench: Towards Comprehensive Understanding of Video Editing | Jan 1, 2025 | Video EditingVideo Understanding | —Unverified | 0 |
| Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing | Jan 1, 2025 | DenoisingVideo Editing | —Unverified | 0 |
| MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation | Dec 28, 2024 | AttributeComputational Efficiency | —Unverified | 0 |
| Efficient Neural Network Encoding for 3D Color Lookup Tables | Dec 19, 2024 | Color ManipulationEfficient Neural Network | CodeCode Available | 0 |
| MotionBridge: Dynamic Video Inbetweening with Flexible Controls | Dec 17, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| MIVE: New Design and Benchmark for Multi-Instance Video Editing | Dec 17, 2024 | Video Editing | —Unverified | 0 |
| Text-Video Multi-Grained Integration for Video Moment Montage | Dec 12, 2024 | SentenceVideo Editing | —Unverified | 0 |
| MoViE: Mobile Diffusion for Video Editing | Dec 9, 2024 | Video Editing | —Unverified | 0 |
| Video Decomposition Prior: A Methodology to Decompose Videos into Layers | Dec 6, 2024 | Semantic SegmentationVideo Editing | —Unverified | 0 |
| Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges | Dec 4, 2024 | Code GenerationImage Comprehension | —Unverified | 0 |
| DIVE: Taming DINO for Subject-Driven Video Editing | Dec 4, 2024 | Image GenerationVideo Editing | —Unverified | 0 |
| OmniCreator: Self-Supervised Unified Generation with Universal Editing | Dec 3, 2024 | DenoisingSemantic correspondence | —Unverified | 0 |
| Trajectory Attention for Fine-grained Video Motion Control | Nov 28, 2024 | Inductive BiasVideo Editing | —Unverified | 0 |
| VideoDirector: Precise Video Editing via Text-to-Video Models | Nov 26, 2024 | AttributeVideo Editing | —Unverified | 0 |
| UVCG: Leveraging Temporal Consistency for Universal Video Protection | Nov 25, 2024 | Computational EfficiencyVideo Editing | —Unverified | 0 |
| VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing | Nov 22, 2024 | Video Editing | —Unverified | 0 |
| Benchmarking the Robustness of Optical Flow Estimation to Corruptions | Nov 22, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 0 |
| A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing | Oct 16, 2024 | Video EditingWord Embeddings | —Unverified | 0 |
| RNA: Video Editing with ROI-based Neural Atlas | Oct 10, 2024 | Video EditingVideo Reconstruction | —Unverified | 0 |
| FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing | Sep 30, 2024 | DenoisingVideo Editing | —Unverified | 0 |
| Portrait Video Editing Empowered by Multimodal Generative Priors | Sep 20, 2024 | Video Editing | —Unverified | 0 |
| EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing Models | Sep 15, 2024 | Video Editing | CodeCode Available | 0 |
| Face Mask Removal with Region-attentive Face Inpainting | Sep 10, 2024 | Face RecognitionFacial Inpainting | CodeCode Available | 0 |
| Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation | Sep 7, 2024 | Image GenerationLayout-to-Image Generation | CodeCode Available | 0 |
| Blended Latent Diffusion under Attention Control for Real-World Video Editing | Sep 5, 2024 | Image GenerationText to Image Generation | —Unverified | 0 |
| Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets | Sep 2, 2024 | Video AlignmentVideo Editing | —Unverified | 0 |
| CSS-Segment: 2nd Place Report of LSVOS Challenge VOS Track | Aug 24, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos | Aug 20, 2024 | Video Editing | —Unverified | 0 |
| Language-Driven Interactive Shadow Detection | Aug 16, 2024 | DescriptiveShadow Detection | CodeCode Available | 0 |
| DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency | Aug 14, 2024 | text-guided-image-editingVideo Editing | —Unverified | 0 |
| Fine-gained Zero-shot Video Sampling | Jul 31, 2024 | Image GenerationVideo Editing | —Unverified | 0 |
| Text-based Talking Video Editing with Cascaded Conditional Diffusion | Jul 20, 2024 | Video Editing | —Unverified | 0 |
| Multi-sentence Video Grounding for Long Video Generation | Jul 18, 2024 | Moment RetrievalRetrieval | —Unverified | 0 |
| InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models | Jul 15, 2024 | ObjectVideo Editing | —Unverified | 0 |
| Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN | Jul 8, 2024 | DisentanglementVideo Editing | —Unverified | 0 |
| Transformer-based Image and Video Inpainting: Current Challenges and Future Directions | Jun 28, 2024 | Image InpaintingVideo Editing | —Unverified | 0 |
| V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data | Jun 20, 2024 | AttributeVideo Editing | —Unverified | 0 |
| VIA: Unified Spatiotemporal Video Adaptation Framework for Global and Local Video Editing | Jun 18, 2024 | Video Editing | —Unverified | 0 |
| VideoGUI: A Benchmark for GUI Automation from Instructional Videos | Jun 14, 2024 | Video Editing | —Unverified | 0 |
| Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion | Jun 13, 2024 | Optical Flow EstimationVideo Editing | —Unverified | 0 |
| 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 12, 2024 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |