SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 601650 of 671 papers

TitleStatusHype
NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal EmbeddingsCode0
Embracing Language Inclusivity and Diversity in CLIP through Continual Language LearningCode0
Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late InteractionCode0
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural SupervisionCode0
MultiWay-Adapater: Adapting large-scale multi-modal models for scalable image-text retrievalCode0
VL-Taboo: An Analysis of Attribute-based Zero-shot Capabilities of Vision-Language ModelsCode0
Multi-stage Pre-training over Simplified Multimodal Pre-training ModelsCode0
Retrieval Augmentation for Deep Neural NetworksCode0
Diving Deep into the Motion Representation of Video-Text ModelsCode0
Dissecting Deep Metric Learning Losses for Image-Text RetrievalCode0
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question AnsweringCode0
Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text RetrievalCode0
Differentiable Outlier Detection Enable Robust Deep Multimodal AnalysisCode0
Multilingual Vision-Language Pre-training for the Remote Sensing DomainCode0
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in IndonesianCode0
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention RecyclingCode0
AugTriever: Unsupervised Dense Retrieval and Domain Adaptation by Scalable Data AugmentationCode0
Exposing and Mitigating Spurious Correlations for Cross-Modal RetrievalCode0
Towards a text-based quantitative and explainable histopathology image analysisCode0
MODOC: A Modular Interface for Flexible Interlinking of Text Retrieval and Text Generation FunctionsCode0
Design of the topology for contrastive visual-textual alignmentCode0
Rudder: A Cross Lingual Video and Text Retrieval DatasetCode0
Modelling Stopping Criteria for Search Results using Poisson ProcessesCode0
Exploiting Positional Bias for Query-Agnostic Generative Content in SearchCode0
Mistral-SPLADE: LLMs for better Learned Sparse RetrievalCode0
An Unsupervised Cross-Modal Hashing Method Robust to Noisy Training Image-Text Correspondences in Remote SensingCode0
Towards Robust Text Retrieval with Progressive LearningCode0
MHSAN: Multi-Head Self-Attention Network for Visual Semantic EmbeddingCode0
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text RetrievalCode0
MeTA: A Unified Toolkit for Text Retrieval and AnalysisCode0
Explaining Text Similarity in Transformer ModelsCode0
Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text RetrievalCode0
Large Vision-Language Models for Knowledge-Grounded Data Annotation of MemesCode0
Expertized Caption Auto-Enhancement for Video-Text RetrievalCode0
A Binary Variational Autoencoder for HashingCode0
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training BenchmarkCode0
Semantic-Preserving Augmentation for Robust Image-Text RetrievalCode0
Adding simple structure at inference improves Vision-Language CompositionalityCode0
Variational Deep Semantic Hashing for Text DocumentsCode0
It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug ReportsCode0
Shallow Cross-Encoders for Low-Latency RetrievalCode0
Tree-Based Text Retrieval via Hierarchical Clustering in RAGFrameworks: Application on Taiwanese RegulationsCode0
A Bi-metric Framework for Fast Similarity SearchCode0
Intra-Modal Constraint Loss For Image-Text RetrievalCode0
Denoising Table-Text Retrieval for Open-Domain Question AnsweringCode0
DeepTileBars: Visualizing Term Distribution for Neural Information RetrievalCode0
Single Shot Scene Text RetrievalCode0
Single-Stream Multi-Level Alignment for Vision-Language PretrainingCode0
Video-Text Retrieval by Supervised Sparse Multi-Grained LearningCode0
Socratic Models: Composing Zero-Shot Multimodal Reasoning with LanguageCode0
Show:102550
← PrevPage 13 of 14Next →

No leaderboard results yet.