Temporal Sentence Grounding

Temporal sentence grounding (TSG) aims to locate a specific moment from an untrimmed video with a given natural language query. For this task, different levels of supervision are used. 1) Weak supervision: video-level action category set; 2) Semi-weak supervision: video-level action category set, and action annotations at several timestamps; 3) Full supervision: Action category and action interval annotations of all actions in untrimmed videos.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 43 papers

Title	Date	Tasks	Status	Hype
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos	May 22, 2025	Natural Language Moment RetrievalNatural Language Queries	CodeCode Available	1
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining	May 10, 2025	Contrastive LearningSentence	—Unverified	0
Contrast-Unity for Partially-Supervised Temporal Sentence Grounding	Feb 18, 2025	Contrastive LearningDenoising	—Unverified	0
Diversified Augmentation with Domain Adaptation for Debiased Video Temporal Grounding	Jan 12, 2025	Data AugmentationDomain Adaptation	—Unverified	0
Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network	Dec 20, 2024	SentenceTemporal Sentence Grounding	—Unverified	0
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models	Oct 4, 2024	Dense Video CaptioningSentence	CodeCode Available	2
Transformer with Controlled Attention for Synchronous Motion Captioning	Sep 13, 2024	Action LocalizationAction Segmentation	CodeCode Available	0
Diversifying Query: Region-Guided Transformer for Temporal Sentence Grounding	May 31, 2024	AttributeMoment Queries	CodeCode Available	0
Video sentence grounding with temporally global textual knowledge	Apr 21, 2024	Contrastive LearningRetrieval	—Unverified	0
Bias-Conflict Sample Synthesis and Adversarial Removal Debias Strategy for Temporal Sentence Grounding in Video	Jan 15, 2024	SentenceTemporal Sentence Grounding	CodeCode Available	0

Show:10 25 50

← PrevPage 1 of 5Next →

All datasets Charades-STA Ego4D-Goalstep

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeCafNet	R1@0.7	47.55	—	Unverified
2	AdaFocus (Full, MViT-Charades-Pretrain-feature, MMN model)	R1@0.7	38.6	—	Unverified
3	AdaFocus (Full, I3D-Charades-Pretrain-feature, MMN model)	R1@0.7	35.6	—	Unverified
4	MMN (Full, MViT-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.7	32.2	—	Unverified
5	MMN (Full, I3D-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.7	29.8	—	Unverified
6	AdaFocus (Weak, MViT-Charades-Pretrain-feature, CPL model)	R1@0.7	23.2	—	Unverified
7	AdaFocus (Weak, I3D-Charades-Pretrain-feature, CPL model)	R1@0.7	22.4	—	Unverified
8	CPL (Weak, MViT-K400-Pretrain-feature, evaluated by AdaFocus)	R1@0.7	21.8	—	Unverified
9	AdaFocus (Semi-weak, MViT-Charades-Pretrain-feature, D3G model)	R1@0.7	21.8	—	Unverified
10	AdaFocus (Semi-weak, I3D-Charades-Pretrain-feature, D3G model)	R1@0.7	21.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DeCafNet-100%	R@1,IoU=0.3	23.2	—	Unverified
2	DeCafNet-50%	R@1,IoU=0.3	21.29	—	Unverified
3	VSLNet	R@1,IoU=0.3	11.7	—	Unverified