| FedVMR: A New Federated Learning method for Video Moment Retrieval | Oct 28, 2022 | Federated LearningMoment Retrieval | —Unverified | 0 | 0 |
| LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval | Sep 27, 2019 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking | Apr 11, 2025 | Moment RetrievalQuestion Answering | —Unverified | 0 | 0 |
| Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training | Feb 28, 2023 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Fast Video Moment Retrieval | Jan 1, 2021 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Faster Video Moment Retrieval with Point-Level Supervision | May 23, 2023 | Moment RetrievalNatural Language Queries | —Unverified | 0 | 0 |
| 2DP-2MRC: 2-Dimensional Pointer-based Machine Reading Comprehension Method for Multimodal Moment Retrieval | Jun 10, 2024 | Boundary DetectionMachine Reading Comprehension | —Unverified | 0 | 0 |
| Event-aware Video Corpus Moment Retrieval | Feb 21, 2024 | Contrastive LearningMoment Retrieval | —Unverified | 0 | 0 |
| EA-VTR: Event-Aware Video-Text Retrieval | Jul 10, 2024 | Action RecognitionContrastive Learning | —Unverified | 0 | 0 |
| EAGLE: Egocentric AGgregated Language-video Engine | Sep 26, 2024 | Action RecognitionActivity Recognition | —Unverified | 0 | 0 |
| D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching | Aug 23, 2024 | Highlight DetectionMoment Retrieval | —Unverified | 0 | 0 |
| Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models | Jan 14, 2025 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Disentangle and denoise: Tackling context misalignment for video moment retrieval | Aug 14, 2024 | DenoisingDisentanglement | —Unverified | 0 | 0 |
| DiffusionVMR: Diffusion Model for Joint Video Moment Retrieval and Highlight Detection | Aug 29, 2023 | DenoisingHighlight Detection | —Unverified | 0 | 0 |
| DeSPITE: Exploring Contrastive Deep Skeleton-Pointcloud-IMU-Text Embeddings for Advanced Point Cloud Human Activity Understanding | Jun 16, 2025 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 | 0 |
| DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments | Dec 28, 2024 | Action LocalizationAction Recognition | —Unverified | 0 | 0 |
| Cross-Lingual Cross-Modal Consolidation for Effective Multilingual Video Corpus Moment Retrieval | Jul 1, 2022 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Video Moment Retrieval via Natural Language Queries | Sep 4, 2020 | Moment RetrievalNatural Language Queries | —Unverified | 0 | 0 |
| Video Moment Retrieval with Text Query Considering Many-to-Many Correspondence Using Potentially Relevant Pair | Jun 25, 2021 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Context-Enhanced Video Moment Retrieval with Large Language Models | May 21, 2024 | cross-modal alignmentLanguage Modeling | —Unverified | 0 | 0 |
| ViSeRet: A simple yet effective approach to moment retrieval via fine-grained video segmentation | Oct 11, 2021 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Coarse to Fine: Video Retrieval before Moment Localization | Oct 14, 2021 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Multi-scale 2D Representation Learning for weakly-supervised moment retrieval | Nov 4, 2021 | Moment RetrievalRepresentation Learning | —Unverified | 0 | 0 |
| Multi-sentence Video Grounding for Long Video Generation | Jul 18, 2024 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |
| Multi-video Moment Ranking with Multimodal Clue | Jan 29, 2023 | Moment RetrievalRetrieval | —Unverified | 0 | 0 |