SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 151175 of 671 papers

TitleStatusHype
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalCode1
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal MappingCode1
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based SearchCode1
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video RetrievalCode1
Helping Hands: An Object-Aware Ego-Centric Video Recognition ModelCode1
Global and Local Semantic Completion Learning for Vision-Language Pre-trainingCode1
GOAL: Global-local Object Alignment LearningCode1
Cross-Modal Retrieval with Partially Mismatched PairsCode1
Cross-Modal Retrieval for Motion and Text via DopTriple LossCode1
Hyperbolic Image-Text RepresentationsCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
FILIP: Fine-grained Interactive Language-Image Pre-TrainingCode1
Densifying Sparse Representations for Passage Retrieval by Representational SlicingCode1
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax LossCode1
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetCode1
Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial TrajectoryCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
Dynamic Modality Interaction Modeling for Image-Text RetrievalCode1
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text RetrievalCode1
Large-Scale Adversarial Training for Vision-and-Language Representation LearningCode1
LDMol: Text-to-Molecule Diffusion Model with Structurally Informative Latent SpaceCode1
Bridging Video-text Retrieval with Multiple Choice QuestionsCode1
Fine-Tuning LLaMA for Multi-Stage Text RetrievalCode1
Cross-modal Contrastive Learning for Speech TranslationCode1
Show:102550
← PrevPage 7 of 27Next →

No leaderboard results yet.