| R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding | Mar 31, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 0 |
| InternVideo2: Scaling Foundation Models for Multimodal Video Understanding | Mar 22, 2024 | Action ClassificationAction Recognition | CodeCode Available | 7 |
| Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding | Mar 14, 2024 | MambaMoment Retrieval | CodeCode Available | 3 |
| GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features | Mar 3, 2024 | DecoderHighlight Detection | —Unverified | 0 |
| Improving Video Corpus Moment Retrieval with Partial Relevance Enhancement | Feb 21, 2024 | Moment RetrievalRetrieval | CodeCode Available | 0 |
| Event-aware Video Corpus Moment Retrieval | Feb 21, 2024 | Contrastive LearningMoment Retrieval | —Unverified | 0 |
| Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval | Jan 24, 2024 | Moment RetrievalRetrieval | —Unverified | 0 |
| TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection | Jan 4, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 |
| Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval | Dec 19, 2023 | cross-modal alignmentMoment Retrieval | CodeCode Available | 1 |
| Cross-modal Contrastive Learning with Asymmetric Co-attention Network for Video Moment Retrieval | Dec 12, 2023 | Contrastive LearningMoment Retrieval | CodeCode Available | 0 |
| Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning | Dec 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos | Nov 30, 2023 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 1 |
| Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection | Nov 28, 2023 | Contrastive LearningHighlight Detection | CodeCode Available | 1 |
| Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding | Nov 15, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 |
| SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval | Oct 8, 2023 | Moment RetrievalRetrieval | —Unverified | 0 |
| Language-Conditioned Change-point Detection to Identify Sub-Tasks in Robotics Domains | Sep 1, 2023 | Change Point DetectionInstruction Following | CodeCode Available | 0 |
| DiffusionVMR: Diffusion Model for Joint Video Moment Retrieval and Highlight Detection | Aug 29, 2023 | DenoisingHighlight Detection | —Unverified | 0 |
| UnLoc: A Unified Framework for Video Localization Tasks | Aug 21, 2023 | Action SegmentationMoment Retrieval | CodeCode Available | 0 |
| MVMR: A New Framework for Evaluating Faithfulness of Video Moment Retrieval against Multiple Distractors | Aug 15, 2023 | Contrastive LearningMisinformation | CodeCode Available | 0 |
| UniVTG: Towards Unified Video-Language Temporal Grounding | Jul 31, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 |
| MomentDiff: Generative Video Moment Retrieval from Random to Real | Jul 6, 2023 | Moment RetrievalRetrieval | CodeCode Available | 1 |
| A Survey on Video Moment Localization | Jun 13, 2023 | Action LocalizationMoment Retrieval | —Unverified | 0 |
| Background-aware Moment Detection for Video Moment Retrieval | Jun 5, 2023 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 1 |
| Faster Video Moment Retrieval with Point-Level Supervision | May 23, 2023 | Moment RetrievalNatural Language Queries | —Unverified | 0 |
| Joint Moment Retrieval and Highlight Detection Via Natural Language Queries | May 8, 2023 | DecoderHighlight Detection | CodeCode Available | 1 |