SOTAVerified|Agents Browse Leaderboard About Blog

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 111 papers

Title	Date	Tasks	Status	Hype
Global and Local Semantic Completion Learning for Vision-Language Pre-training	Jun 12, 2023	cross-modal alignmentImage-text Retrieval	CodeCode Available	1
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding	Jun 5, 2023	Language ModelingLanguage Modelling	CodeCode Available	4
VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending	May 22, 2023	Question AnsweringRetrieval	—Unverified	0
Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval	May 13, 2023	RetrievalText Retrieval	—Unverified	0
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception	May 10, 2023	Classificationimage-classification	—Unverified	0
SViTT: Temporal Learning of Sparse Video-Text Transformers	Apr 18, 2023	Question AnsweringRetrieval	CodeCode Available	1
CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning	Mar 22, 2023	Contrastive LearningRetrieval	—Unverified	0
Deep Learning for Video-Text Retrieval: a Review	Feb 24, 2023	Deep LearningRetrieval	—Unverified	0
Cross-Modal Retrieval with Partially Mismatched Pairs	Feb 22, 2023	Contrastive LearningCross-Modal Retrieval	CodeCode Available	1
Video-Text Retrieval by Supervised Sparse Multi-Grained Learning	Feb 19, 2023	Representation LearningRetrieval	CodeCode Available	0

Show:10 25 50

← PrevPage 6 of 12Next →

No leaderboard results yet.