SOTAVerified

Natural Language Moment Retrieval

Papers

Showing 122 of 22 papers

TitleStatusHype
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment RetrievalCode2
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
UniMD: Towards Unifying Moment Retrieval and Temporal Action DetectionCode2
Correlation-Guided Query-Dependency Calibration for Video Temporal GroundingCode2
UniVTG: Towards Unified Video-Language Temporal GroundingCode2
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long VideosCode1
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight DetectionCode1
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal GroundingCode1
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long VideosCode1
Saliency-Guided DETR for Moment Retrieval and Highlight DetectionCode1
RGNet: A Unified Clip Retrieval and Grounding Network for Long VideosCode1
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in VideosCode1
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight DetectionCode1
Background-aware Moment Detection for Video Moment RetrievalCode1
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed VideosCode1
Localizing Moments in Long Video Via Multimodal GuidanceCode1
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsCode1
VLG-Net: Video-Language Graph Matching Network for Video GroundingCode1
Dense Regression Network for Video GroundingCode1
LLaVA-MR: Large Language-and-Vision Assistant for Video Moment RetrievalCode0
UnLoc: A Unified Framework for Video Localization TasksCode0
MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment0
Show:102550

No leaderboard results yet.