| ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos | Nov 22, 2024 | Language-Based Temporal LocalizationLanguage Modeling | CodeCode Available | 1 |
| RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos | Dec 11, 2023 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 |
| Saliency-Guided DETR for Moment Retrieval and Highlight Detection | Oct 2, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 |
| BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos | Nov 30, 2023 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 1 |
| Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection | Nov 28, 2023 | Contrastive LearningHighlight Detection | CodeCode Available | 1 |
| DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos | May 22, 2025 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 |
| Dense Regression Network for Video Grounding | Apr 7, 2020 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 |
| FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding | Dec 18, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 |
| VLG-Net: Video-Language Graph Matching Network for Video Grounding | Nov 19, 2020 | Graph MatchingMoment Retrieval | CodeCode Available | 1 |
| MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment | Nov 30, 2018 | Moment RetrievalNatural Language Moment Retrieval | —Unverified | 0 |