SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 76100 of 671 papers

TitleStatusHype
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information RetrievalCode1
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge GraphsCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetCode1
AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive LearningCode1
Fine-Tuning LLaMA for Multi-Stage Text RetrievalCode1
FlexiViT: One Model for All Patch SizesCode1
Dense Hierarchical Retrieval for Open-Domain Question AnsweringCode1
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text RetrievalCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
A Comprehensive Review of the Video-to-Text ProblemCode1
DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge BasesCode1
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video RetrievalCode1
Bridging Language Gaps in Audio-Text RetrievalCode1
Data-Efficient Multimodal Fusion on a Single GPUCode1
Bridging Video-text Retrieval with Multiple Choice QuestionsCode1
Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and DataCode1
A Prior Instruction Representation Framework for Remote Sensing Image-text RetrievalCode1
A Dense Representation Framework for Lexical and Semantic MatchingCode1
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingCode1
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding EvaluationCode1
ArabicaQA: A Comprehensive Dataset for Arabic Question AnsweringCode1
DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text RetrievalCode1
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
Show:102550
← PrevPage 4 of 27Next →

No leaderboard results yet.