Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 671 papers

Title	Date	Tasks	Status	Hype
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval	Nov 19, 2024	DiversityNatural Language Queries	—Unverified	0
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?	Nov 19, 2024	RetrievalText Retrieval	—Unverified	0
Partial Scene Text Retrieval	Nov 15, 2024	Multiple Instance LearningRetrieval	CodeCode Available	0
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs	Nov 4, 2024	Cross-Modal RetrievalInformation Retrieval	—Unverified	0
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities	Nov 4, 2024	AttributeDescriptive	—Unverified	0
Nearest Neighbor Normalization Improves Multimodal Retrieval	Oct 31, 2024	Cross-Modal RetrievalImage Captioning	CodeCode Available	1
Multilingual Vision-Language Pre-training for the Remote Sensing Domain	Oct 30, 2024	Cross-Modal Retrievalimage-classification	CodeCode Available	0
Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization	Oct 30, 2024	Image to textImage-to-Text Retrieval	—Unverified	0
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications	Oct 29, 2024	Image RetrievalRAG	CodeCode Available	2
Do Audio-Language Models Understand Linguistic Variations?	Oct 21, 2024	Contrastive LearningNatural Language Queries	—Unverified	0
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Oct 20, 2024	Image RetrievalImage-text Retrieval	CodeCode Available	0
Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging	Oct 19, 2024	modelSemantic Textual Similarity	—Unverified	0
Beyond Coarse-Grained Matching in Video-Text Retrieval	Oct 16, 2024	RetrievalText Retrieval	—Unverified	0
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning	Oct 15, 2024	Image-text RetrievalText Retrieval	—Unverified	0
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval	Oct 9, 2024	RetrievalText Retrieval	CodeCode Available	1
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning	Oct 9, 2024	Large Language ModelMotion Captioning	—Unverified	0
AnyAttack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models	Oct 7, 2024	Image CaptioningImage-text Retrieval	—Unverified	0
CoLLAP: Contrastive Long-form Language-Audio Pretraining with Musical Temporal Structure Augmentation	Oct 3, 2024	Contrastive LearningForm	—Unverified	0
From Unimodal to Multimodal: Scaling up Projectors to Align Modalities	Sep 28, 2024	Image-text RetrievalSemantic Similarity	CodeCode Available	0
Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization	Sep 26, 2024	Image to textImage-to-Text Retrieval	—Unverified	0
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval	Sep 16, 2024	AudioCapsRetrieval	—Unverified	0
NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training	Sep 15, 2024	Contrastive Learningcross-modal alignment	—Unverified	0
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds	Sep 13, 2024	Audio ClassificationDescriptive	CodeCode Available	1
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG	Sep 12, 2024	BenchmarkingQuestion Answering	—Unverified	0
Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations	Sep 11, 2024	Image-text RetrievalText Retrieval	—Unverified	0
Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E5	Sep 9, 2024	BenchmarkingInformation Retrieval	—Unverified	0
MODOC: A Modular Interface for Flexible Interlinking of Text Retrieval and Text Generation Functions	Aug 26, 2024	Information RetrievalRetrieval	CodeCode Available	0
Mistral-SPLADE: LLMs for better Learned Sparse Retrieval	Aug 20, 2024	DecoderLanguage Modeling	CodeCode Available	0
Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores	Aug 19, 2024	RetrievalSemantic Textual Similarity	—Unverified	0
NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality	Aug 18, 2024	RetrievalText Retrieval	—Unverified	0
Mamba Retriever: Utilizing Mamba for Effective and Efficient Dense Retrieval	Aug 15, 2024	Information RetrievalMamba	—Unverified	0
Pairing Clustered Inverted Indexes with kNN Graphs for Fast Approximate Retrieval over Learned Sparse Representations	Aug 8, 2024	RetrievalText Retrieval	—Unverified	0
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark	Aug 5, 2024	Dense Video CaptioningDiversity	CodeCode Available	1
Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation	Aug 2, 2024	Image-text RetrievalRetrieval	—Unverified	0
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval	Aug 1, 2024	AttributeOptical Character Recognition	CodeCode Available	1
Learning Video Context as Interleaved Multimodal Sequences	Jul 31, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models	Jul 30, 2024	Image to textImage-to-Text Retrieval	CodeCode Available	0
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval	Jul 29, 2024	Contrastive LearningReranking	—Unverified	0
FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis	Jul 29, 2024	Image-text RetrievalModel Selection	CodeCode Available	0
Multi-label Cluster Discrimination for Visual Representation Learning	Jul 24, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	4
Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective	Jul 21, 2024	Image-text RetrievalInformation Retrieval	—Unverified	0
Multimodal Misinformation Detection using Large Vision-Language Models	Jul 19, 2024	Fact CheckingFact Verification	—Unverified	0
Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval	Jul 17, 2024	Image-text RetrievalObject	CodeCode Available	0
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval	Jul 16, 2024	Question AnsweringRetrieval	CodeCode Available	5
Video-Language Alignment via Spatio-Temporal Graph Transformer	Jul 16, 2024	Contrastive LearningQuestion Answering	CodeCode Available	1
EA-VTR: Event-Aware Video-Text Retrieval	Jul 10, 2024	Action RecognitionContrastive Learning	—Unverified	0
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?	Jul 10, 2024	Contrastive LearningImage-text Retrieval	—Unverified	0
Towards a text-based quantitative and explainable histopathology image analysis	Jul 10, 2024	image-classificationImage Classification	CodeCode Available	0
CosmoCLIP: Generalizing Large Vision-Language Models for Astronomical Imaging	Jul 10, 2024	Contrastive LearningImage-text Retrieval	—Unverified	0
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding	Jul 9, 2024	Contrastive LearningDomain Adaptation	—Unverified	0

Show:10 25 50

← PrevPage 3 of 14Next →

No leaderboard results yet.