| MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation | Feb 6, 2025 | Image to Video GenerationVideo Editing | —Unverified | 0 |
| EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues | Feb 4, 2025 | Dialogue InterpretationDialogue Understanding | —Unverified | 0 |
| Exploring Temporally-Aware Features for Point Tracking | Jan 21, 2025 | Point TrackingVideo Editing | CodeCode Available | 2 |
| Counteracting temporal attacks in Video Copy Detection | Jan 19, 2025 | Copy DetectionVideo Editing | —Unverified | 0 |
| IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion | Jan 13, 2025 | Video Editing | —Unverified | 0 |
| SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Jan 13, 2025 | Objectobject-detection | CodeCode Available | 0 |
| Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning | Jan 11, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs | Jan 10, 2025 | Video Editing | —Unverified | 0 |
| Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion | Jan 8, 2025 | Video Editing | CodeCode Available | 1 |
| Edit as You See: Image-guided Video Editing via Masked Motion Modeling | Jan 8, 2025 | Optical Flow EstimationSelf-Supervised Learning | —Unverified | 0 |
| JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing | Jan 3, 2025 | 3D ReconstructionFace Generation | CodeCode Available | 3 |
| Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing | Jan 1, 2025 | DenoisingVideo Editing | —Unverified | 0 |
| Consistent and Controllable Image Animation with Motion Diffusion Models | Jan 1, 2025 | Image AnimationVideo Editing | —Unverified | 0 |
| Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space | Jan 1, 2025 | Image-to-Image TranslationVideo Editing | —Unverified | 0 |
| VEU-Bench: Towards Comprehensive Understanding of Video Editing | Jan 1, 2025 | Video EditingVideo Understanding | —Unverified | 0 |
| Unity in Diversity: Video Editing via Gradient-Latent Purification | Jan 1, 2025 | DiversityUnity | —Unverified | 0 |
| MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation | Dec 28, 2024 | AttributeComputational Efficiency | —Unverified | 0 |
| DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Dec 27, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 1 |
| DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation | Dec 24, 2024 | Video EditingVideo Generation | CodeCode Available | 3 |
| Efficient Neural Network Encoding for 3D Color Lookup Tables | Dec 19, 2024 | Color ManipulationEfficient Neural Network | CodeCode Available | 0 |
| MotionBridge: Dynamic Video Inbetweening with Flexible Controls | Dec 17, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| MIVE: New Design and Benchmark for Multi-Instance Video Editing | Dec 17, 2024 | Video Editing | —Unverified | 0 |
| Re-Attentional Controllable Video Diffusion Editing | Dec 16, 2024 | DenoisingVideo Editing | CodeCode Available | 1 |
| Video Seal: Open and Efficient Video Watermarking | Dec 12, 2024 | Video CompressionVideo Editing | CodeCode Available | 4 |
| Text-Video Multi-Grained Integration for Video Moment Montage | Dec 12, 2024 | SentenceVideo Editing | —Unverified | 0 |
| MoViE: Mobile Diffusion for Video Editing | Dec 9, 2024 | Video Editing | —Unverified | 0 |
| Video Decomposition Prior: A Methodology to Decompose Videos into Layers | Dec 6, 2024 | Semantic SegmentationVideo Editing | —Unverified | 0 |
| Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges | Dec 4, 2024 | Code GenerationImage Comprehension | —Unverified | 0 |
| DIVE: Taming DINO for Subject-Driven Video Editing | Dec 4, 2024 | Image GenerationVideo Editing | —Unverified | 0 |
| OmniCreator: Self-Supervised Unified Generation with Universal Editing | Dec 3, 2024 | DenoisingSemantic correspondence | —Unverified | 0 |
| Trajectory Attention for Fine-grained Video Motion Control | Nov 28, 2024 | Inductive BiasVideo Editing | —Unverified | 0 |
| VideoDirector: Precise Video Editing via Text-to-Video Models | Nov 26, 2024 | AttributeVideo Editing | —Unverified | 0 |
| UVCG: Leveraging Temporal Consistency for Universal Video Protection | Nov 25, 2024 | Computational EfficiencyVideo Editing | —Unverified | 0 |
| VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing | Nov 22, 2024 | Video Editing | —Unverified | 0 |
| Benchmarking the Robustness of Optical Flow Estimation to Corruptions | Nov 22, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 0 |
| StableV2V: Stablizing Shape Consistency in Video-to-Video Editing | Nov 17, 2024 | Video Editing | CodeCode Available | 2 |
| OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models | Nov 15, 2024 | Optical Flow EstimationText-to-Video Generation | CodeCode Available | 1 |
| A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model | Nov 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Taming Rectified Flow for Inversion and Editing | Nov 7, 2024 | Image GenerationText-to-Image Generation | CodeCode Available | 4 |
| AutoVFX: Physically Realistic Video Editing from Natural Language Instructions | Nov 4, 2024 | Code GenerationVideo Editing | CodeCode Available | 3 |
| UniVST: A Unified Framework for Training-free Localized Video Style Transfer | Oct 26, 2024 | Style TransferVideo Editing | CodeCode Available | 2 |
| Movie Gen: A Cast of Media Foundation Models | Oct 17, 2024 | Audio GenerationVideo Editing | CodeCode Available | 3 |
| Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing | Oct 16, 2024 | Video EditingWord Embeddings | —Unverified | 0 |
| RNA: Video Editing with ROI-based Neural Atlas | Oct 10, 2024 | Video EditingVideo Reconstruction | —Unverified | 0 |
| FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing | Sep 30, 2024 | DenoisingVideo Editing | —Unverified | 0 |
| Portrait Video Editing Empowered by Multimodal Generative Priors | Sep 20, 2024 | Video Editing | —Unverified | 0 |
| EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing Models | Sep 15, 2024 | Video Editing | CodeCode Available | 0 |
| Face Mask Removal with Region-attentive Face Inpainting | Sep 10, 2024 | Face RecognitionFacial Inpainting | CodeCode Available | 0 |
| Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation | Sep 7, 2024 | Image GenerationLayout-to-Image Generation | CodeCode Available | 0 |
| Blended Latent Diffusion under Attention Control for Real-World Video Editing | Sep 5, 2024 | Image GenerationText to Image Generation | —Unverified | 0 |