| RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos | Dec 11, 2023 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 |
| BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos | Nov 30, 2023 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 1 |
| Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection | Nov 28, 2023 | Contrastive LearningHighlight Detection | CodeCode Available | 1 |
| Background-aware Moment Detection for Video Moment Retrieval | Jun 5, 2023 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 1 |
| Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos | Mar 11, 2023 | Dense Video CaptioningNatural Language Moment Retrieval | CodeCode Available | 1 |
| Localizing Moments in Long Video Via Multimodal Guidance | Feb 26, 2023 | Natural Language Moment RetrievalNatural Language Visual Grounding | CodeCode Available | 1 |
| MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions | Dec 1, 2021 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 1 |
| VLG-Net: Video-Language Graph Matching Network for Video Grounding | Nov 19, 2020 | Graph MatchingMoment Retrieval | CodeCode Available | 1 |
| Dense Regression Network for Video Grounding | Apr 7, 2020 | Natural Language Moment RetrievalNatural Language Queries | CodeCode Available | 1 |
| LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval | Nov 21, 2024 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 0 |