SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 226250 of 671 papers

TitleStatusHype
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text RetrievalCode1
Graph Optimal Transport for Cross-Domain AlignmentCode1
Large-Scale Adversarial Training for Vision-and-Language Representation LearningCode1
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal TransformersCode1
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text RetrievalCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
Knowledge Guided Text Retrieval and Reading for Open Domain Question AnsweringCode1
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalCode1
UNITER: UNiversal Image-TExt Representation LearningCode1
XQA: A Cross-lingual Open-domain Question Answering DatasetCode1
Polysemous Visual-Semantic Embedding for Cross-Modal RetrievalCode1
Learning a Text-Video Embedding from Incomplete and Heterogeneous DataCode1
Stacked Cross Attention for Image-Text MatchingCode1
Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval0
Tree-Based Text Retrieval via Hierarchical Clustering in RAGFrameworks: Application on Taiwanese RegulationsCode0
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention RecyclingCode0
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration0
Adding simple structure at inference improves Vision-Language CompositionalityCode0
Beyond Cropped Regions: New Benchmark and Corresponding Baseline for Chinese Scene Text Retrieval in Diverse Layouts0
Attacking Attention of Foundation Models Disrupts Downstream TasksCode0
ERU-KG: Efficient Reference-aligned Unsupervised Keyphrase GenerationCode0
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval0
Distill CLIP (DCLIP): Enhancing Image-Text Retrieval via Cross-Modal Transformer Distillation0
EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models0
Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval0
Show:102550
← PrevPage 10 of 27Next →

No leaderboard results yet.