| Zero-Shot Video Question Answering with Procedural Programs | Dec 1, 2023 | Code GenerationLanguage Modeling | —Unverified | 0 |
| VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models | Dec 1, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs | Nov 30, 2023 | Image GenerationNeRF | —Unverified | 0 |
| Motion-Conditioned Image Animation for Video Editing | Nov 30, 2023 | Image AnimationVideo Editing | —Unverified | 0 |
| MotionEditor: Editing Video Motion via Content-Aware Diffusion | Nov 30, 2023 | Video Editing | CodeCode Available | 1 |
| VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models | Nov 30, 2023 | Semantic SegmentationVideo Editing | —Unverified | 0 |
| MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing | Nov 29, 2023 | DenoisingImage to Video Generation | CodeCode Available | 1 |
| Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings | Nov 27, 2023 | Video Editing | —Unverified | 0 |
| Sketch Video Synthesis | Nov 26, 2023 | Video Editing | CodeCode Available | 2 |
| Highly Detailed and Temporal Consistent Video Stylization via Synchronized Multi-Frame Diffusion | Nov 24, 2023 | DenoisingOptical Flow Estimation | —Unverified | 0 |
| Cut-and-Paste: Subject-Driven Video Editing with Attention Control | Nov 20, 2023 | ObjectVideo Editing | —Unverified | 0 |
| Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation | Nov 14, 2023 | ObjectVideo Editing | CodeCode Available | 2 |
| Learning the What and How of Annotation in Video Object Segmentation | Nov 8, 2023 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Consistent Video-to-Video Transfer Using Synthetic Dataset | Nov 1, 2023 | Video Editing | CodeCode Available | 1 |
| Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models | Oct 25, 2023 | DenoisingVideo Editing | CodeCode Available | 0 |
| CVPR 2023 Text Guided Video Editing Competition | Oct 24, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation | Oct 16, 2023 | GPUImage Animation | CodeCode Available | 2 |
| DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing | Oct 16, 2023 | NeRFStyle Transfer | —Unverified | 0 |
| A Survey on Video Diffusion Models | Oct 16, 2023 | Image GenerationSurvey | CodeCode Available | 4 |
| LOVECon: Text-driven Training-Free Long Video Editing with ControlNet | Oct 15, 2023 | Style TransferVideo Editing | CodeCode Available | 1 |
| Cross-modal Cognitive Consensus guided Audio-Visual Segmentation | Oct 10, 2023 | ObjectSegmentation | CodeCode Available | 0 |
| FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing | Oct 9, 2023 | Optical Flow EstimationText-to-Video Editing | CodeCode Available | 2 |
| Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models | Oct 2, 2023 | AttributeOptical Flow Estimation | CodeCode Available | 1 |
| CCEdit: Creative and Controllable Video Editing via Diffusion Models | Sep 28, 2023 | Image GenerationText-to-Image Generation | CodeCode Available | 1 |
| UVL2: A Unified Framework for Video Tampering Localization | Sep 28, 2023 | Face SwappingVideo Editing | —Unverified | 0 |
| Hashing Neural Video Decomposition with Multiplicative Residuals in Space-Time | Sep 25, 2023 | GPUVideo Editing | —Unverified | 0 |
| Adversarial Attacks on Video Object Segmentation with Hard Region Discovery | Sep 25, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation | Sep 2, 2023 | Video Editing | —Unverified | 0 |
| 1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation | Aug 28, 2023 | Autonomous DrivingDenoising | CodeCode Available | 1 |
| MagicEdit: High-Fidelity and Temporally Coherent Video Editing | Aug 28, 2023 | TranslationVideo Editing | —Unverified | 0 |
| EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints | Aug 21, 2023 | Video Editing | —Unverified | 0 |
| StableVideo: Text-driven Consistency-aware Diffusion Video Editing | Aug 18, 2023 | Video Editing | CodeCode Available | 3 |
| SimDA: Simple Diffusion Adapter for Efficient Video Generation | Aug 18, 2023 | Super-ResolutionTransfer Learning | —Unverified | 0 |
| Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis | Aug 18, 2023 | Dynamic ReconstructionNovel View Synthesis | CodeCode Available | 4 |
| Edit Temporal-Consistent Videos with Image Diffusion Model | Aug 17, 2023 | modelVideo Editing | CodeCode Available | 0 |
| InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing | Jul 22, 2023 | DenoisingVideo Editing | —Unverified | 0 |
| TokenFlow: Consistent Diffusion Features for Consistent Video Editing | Jul 19, 2023 | Video Editing | CodeCode Available | 3 |
| INVE: Interactive Neural Video Editing | Jul 15, 2023 | Video Editing | —Unverified | 0 |
| VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing | Jun 14, 2023 | Image GenerationVideo Editing | —Unverified | 0 |
| 1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation | Jun 7, 2023 | Autonomous DrivingPanoptic Segmentation | CodeCode Available | 1 |
| DVIS: Decoupled Video Instance Segmentation Framework | Jun 6, 2023 | Autonomous DrivingGPU | CodeCode Available | 1 |
| dugMatting: Decomposed-Uncertainty-Guided Matting | Jun 2, 2023 | Image MattingVideo Editing | CodeCode Available | 1 |
| FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models | Jun 1, 2023 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 1 |
| SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing | May 30, 2023 | Style TransferVideo Editing | CodeCode Available | 1 |
| Towards Consistent Video Editing with Text-to-Image Diffusion Models | May 27, 2023 | One-Shot LearningVideo Editing | —Unverified | 0 |
| ControlVideo: Conditional Control for One-shot Text-driven Video Editing and Beyond | May 26, 2023 | Text-to-Video EditingVideo Editing | CodeCode Available | 2 |
| Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning | May 23, 2023 | Image GenerationOptical Flow Estimation | CodeCode Available | 2 |
| A Dual-level Detection Method for Video Copy Detection | May 21, 2023 | Copy DetectionPartial Video Copy Detection | CodeCode Available | 1 |
| InstructVid2Vid: Controllable Video Editing with Natural Language Instructions | May 21, 2023 | AttributeImage Generation | —Unverified | 0 |
| Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts | May 15, 2023 | DenoisingVideo Editing | —Unverified | 0 |