SOTAVerified

Natural Language Moment Retrieval

Papers

Showing 1120 of 22 papers

TitleStatusHype
RGNet: A Unified Clip Retrieval and Grounding Network for Long VideosCode1
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in VideosCode1
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight DetectionCode1
Background-aware Moment Detection for Video Moment RetrievalCode1
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed VideosCode1
Localizing Moments in Long Video Via Multimodal GuidanceCode1
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsCode1
VLG-Net: Video-Language Graph Matching Network for Video GroundingCode1
Dense Regression Network for Video GroundingCode1
LLaVA-MR: Large Language-and-Vision Assistant for Video Moment RetrievalCode0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.