SOTAVerified

Natural Language Moment Retrieval

Papers

Showing 122 of 22 papers

TitleStatusHype
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment RetrievalCode2
UniVTG: Towards Unified Video-Language Temporal GroundingCode2
UniMD: Towards Unifying Moment Retrieval and Temporal Action DetectionCode2
Correlation-Guided Query-Dependency Calibration for Video Temporal GroundingCode2
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight DetectionCode1
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed VideosCode1
Localizing Moments in Long Video Via Multimodal GuidanceCode1
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio DescriptionsCode1
Background-aware Moment Detection for Video Moment RetrievalCode1
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long VideosCode1
RGNet: A Unified Clip Retrieval and Grounding Network for Long VideosCode1
Saliency-Guided DETR for Moment Retrieval and Highlight DetectionCode1
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in VideosCode1
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight DetectionCode1
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long VideosCode1
Dense Regression Network for Video GroundingCode1
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal GroundingCode1
VLG-Net: Video-Language Graph Matching Network for Video GroundingCode1
LLaVA-MR: Large Language-and-Vision Assistant for Video Moment RetrievalCode0
UnLoc: A Unified Framework for Video Localization TasksCode0
MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment0
Show:102550

No leaderboard results yet.