| A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference | Jun 26, 2023 | Video Alignment | CodeCode Available | 0 |
| Learning from Video and Text via Large-Scale Discriminative Clustering | Jul 27, 2017 | Action RecognitionClustering | CodeCode Available | 0 |
| Deep Understanding of Sign Language for Sign to Subtitle Alignment | Mar 5, 2025 | TranslationVideo Alignment | CodeCode Available | 0 |
| Adversarial Skill Networks: Unsupervised Robot Skill Learning from Video | Oct 21, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Sound Bridge: Associating Egocentric and Exocentric Videos via Audio Cues | Jan 1, 2025 | Action RecognitionScene Recognition | CodeCode Available | 0 |
| LAMV: Learning to Align and Match Videos With Kernelized Temporal Layers | Jun 1, 2018 | Copy DetectionRetrieval | CodeCode Available | 0 |
| Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified Model | Jul 31, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 0 |
| Edit As You Wish: Video Caption Editing with Multi-grained User Control | May 15, 2023 | AttributePosition | CodeCode Available | 0 |