SOTAVerified|Agents Browse Leaderboard About Blog

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 671 papers

Title	Date	Tasks	Status	Hype
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval	Nov 19, 2024	DiversityNatural Language Queries	—Unverified	0
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?	Nov 19, 2024	RetrievalText Retrieval	—Unverified	0
Partial Scene Text Retrieval	Nov 15, 2024	Multiple Instance LearningRetrieval	CodeCode Available	0
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs	Nov 4, 2024	Cross-Modal RetrievalInformation Retrieval	—Unverified	0
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities	Nov 4, 2024	AttributeDescriptive	—Unverified	0
Nearest Neighbor Normalization Improves Multimodal Retrieval	Oct 31, 2024	Cross-Modal RetrievalImage Captioning	CodeCode Available	1
Multilingual Vision-Language Pre-training for the Remote Sensing Domain	Oct 30, 2024	Cross-Modal Retrievalimage-classification	CodeCode Available	0
Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization	Oct 30, 2024	Image to textImage-to-Text Retrieval	—Unverified	0
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications	Oct 29, 2024	Image RetrievalRAG	CodeCode Available	2
Do Audio-Language Models Understand Linguistic Variations?	Oct 21, 2024	Contrastive LearningNatural Language Queries	—Unverified	0
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Oct 20, 2024	Image RetrievalImage-text Retrieval	CodeCode Available	0
Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging	Oct 19, 2024	modelSemantic Textual Similarity	—Unverified	0
Beyond Coarse-Grained Matching in Video-Text Retrieval	Oct 16, 2024	RetrievalText Retrieval	—Unverified	0
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning	Oct 15, 2024	Image-text RetrievalText Retrieval	—Unverified	0
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning	Oct 9, 2024	Large Language ModelMotion Captioning	—Unverified	0
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval	Oct 9, 2024	RetrievalText Retrieval	CodeCode Available	1
AnyAttack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models	Oct 7, 2024	Image CaptioningImage-text Retrieval	—Unverified	0
CoLLAP: Contrastive Long-form Language-Audio Pretraining with Musical Temporal Structure Augmentation	Oct 3, 2024	Contrastive LearningForm	—Unverified	0
From Unimodal to Multimodal: Scaling up Projectors to Align Modalities	Sep 28, 2024	Image-text RetrievalSemantic Similarity	CodeCode Available	0
Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization	Sep 26, 2024	Image to textImage-to-Text Retrieval	—Unverified	0
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval	Sep 16, 2024	AudioCapsRetrieval	—Unverified	0
NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training	Sep 15, 2024	Contrastive Learningcross-modal alignment	—Unverified	0
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds	Sep 13, 2024	Audio ClassificationDescriptive	CodeCode Available	1
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG	Sep 12, 2024	BenchmarkingQuestion Answering	—Unverified	0
Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations	Sep 11, 2024	Image-text RetrievalText Retrieval	—Unverified	0

Show:10 25 50

← PrevPage 5 of 27Next →

No leaderboard results yet.