SOTAVerified|Agents Browse Leaderboard About Blog

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 81–90 of 111 papers

Title	Date	Tasks	Status	Hype
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs	Jun 9, 2022	Image CaptioningImage Classification	CodeCode Available	2
Egocentric Video-Language Pretraining	Jun 3, 2022	Action RecognitionContrastive Learning	CodeCode Available	2
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition	May 29, 2022	Language AcquisitionRetrieval	—Unverified	0
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections	May 24, 2022	Computational Efficiencycross-modal alignment	CodeCode Available	1
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval	Apr 26, 2022	Action RecognitionRetrieval	CodeCode Available	1
X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval	Mar 28, 2022	RetrievalText to Video Retrieval	CodeCode Available	1
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding	Mar 11, 2022	RetrievalText Retrieval	—Unverified	0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding	Jan 16, 2022	RetrievalText Retrieval	—Unverified	0
Bridging Video-text Retrieval with Multiple Choice Questions	Jan 13, 2022	Action RecognitionLinear evaluation	CodeCode Available	1
Video-Text Pre-training with Learned Regions	Dec 2, 2021	Representation LearningRetrieval	CodeCode Available	1

Show:10 25 50

← PrevPage 9 of 12Next →

No leaderboard results yet.