| FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF | Jan 5, 2024 | NeRFVideo Editing | CodeCode Available | 1 | 5 |
| InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction | Mar 26, 2025 | Instruction FollowingVideo Editing | CodeCode Available | 1 | 5 |
| A Light Weight Model for Active Speaker Detection | Mar 8, 2023 | Active Speaker DetectionAudio-Visual Active Speaker Detection | CodeCode Available | 1 | 5 |
| Feature Combination Meets Attention: Baidu Soccer Embeddings and Transformer based Temporal Detection | Jun 28, 2021 | Action RecognitionAction Spotting | CodeCode Available | 1 | 5 |
| Consistent Video-to-Video Transfer Using Synthetic Dataset | Nov 1, 2023 | Video Editing | CodeCode Available | 1 | 5 |
| 1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation | Jun 7, 2023 | Autonomous DrivingPanoptic Segmentation | CodeCode Available | 1 | 5 |
| FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models | Jun 1, 2023 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 1 | 5 |
| Layered Neural Atlases for Consistent Video Editing | Sep 23, 2021 | Style TransferVideo Editing | CodeCode Available | 1 | 5 |
| MovieCuts: A New Dataset and Benchmark for Cut Type Recognition | Sep 12, 2021 | Video EditingVocal Bursts Type Prediction | CodeCode Available | 1 | 5 |
| Patch-based Object-centric Transformers for Efficient Video Generation | Jun 8, 2022 | ObjectVideo Editing | CodeCode Available | 1 | 5 |