SOTAVerified|Agents Browse Leaderboard About

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 91–100 of 111 papers

Title	Date	Tasks	Status	Hype
TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval	Sep 28, 2022	cross-modal alignmentRetrieval	—Unverified	0
Towards Understanding Camera Motions in Any Video	Apr 21, 2025	Question AnsweringText Retrieval	—Unverified	0
Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval	Sep 21, 2023	Domain AdaptationRetrieval	—Unverified	0
Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval	Sep 21, 2023	Domain AdaptationRetrieval	—Unverified	0
Uncertainty-aware sign language video retrieval with probability distribution modeling	May 30, 2024	RetrievalSign Language Retrieval	—Unverified	0
Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval	Sep 28, 2022	Contrastive LearningRetrieval	—Unverified	0
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval	Feb 26, 2024	RetrievalText Retrieval	—Unverified	0
V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts	Mar 3, 2025	Contrastive LearningText Retrieval	—Unverified	0
V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts	Jan 1, 2025	Contrastive LearningText Retrieval	—Unverified	0
Videoprompter: an ensemble of foundational models for zero-shot video understanding	Oct 23, 2023	Action RecognitionDescriptive	—Unverified	0

Show:10 25 50

← PrevPage 10 of 12Next →

No leaderboard results yet.