SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 351400 of 671 papers

TitleStatusHype
Towards a text-based quantitative and explainable histopathology image analysisCode0
CosmoCLIP: Generalizing Large Vision-Language Models for Astronomical Imaging0
EA-VTR: Event-Aware Video-Text Retrieval0
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding0
Neurocache: Efficient Vector Retrieval for Long-range Language ModelingCode0
Memory^3: Language Modeling with Explicit Memory0
PathAlign: A vision-language model for whole slide images in histopathology0
Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive LearningCode0
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling0
Evaluating D-MERIT of Partial-annotation on Information Retrieval0
Multi-Scale Temporal Difference Transformer for Video-Text Retrieval0
RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation0
Symmetric Multi-Similarity Loss for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2024Code0
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News RecommendationCode0
Unifying Multimodal Retrieval via Document Screenshot Embedding0
BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image RetrievalCode0
Enhancing Knowledge Retrieval with In-Context Learning and Semantic Search through Generative AI0
Which Country Is This? Automatic Country Ranking of Street View PhotosCode0
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval0
Diving Deep into the Motion Representation of Video-Text ModelsCode0
A Bi-metric Framework for Fast Similarity SearchCode0
HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model0
Jina CLIP: Your CLIP Model Is Also Your Text Retriever0
Uncertainty-aware sign language video retrieval with probability distribution modeling0
Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training0
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships0
Multilingual Diversity Improves Vision-Language Representations0
Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning0
Active Learning for Finely-Categorized Image-Text Retrieval by Selecting Hard Negative Unpaired Samples0
An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval0
Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval0
RETTA: Retrieval-Enhanced Test-Time Adaptation for Zero-Shot Video Captioning0
Explaining Text Similarity in Transformer ModelsCode0
ProCIS: A Benchmark for Proactive Retrieval in ConversationsCode0
Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-TuningCode0
Exploiting Positional Bias for Query-Agnostic Generative Content in SearchCode0
Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation0
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical AlterationsCode0
UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation0
MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction0
FecTek: Enhancing Term Weight in Lexicon-Based Retrieval with Feature Context and Term-level Knowledge0
TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model0
Learning with Noisy Correspondence0
HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models0
Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement0
Shallow Cross-Encoders for Low-Latency RetrievalCode0
Denoising Table-Text Retrieval for Open-Domain Question AnsweringCode0
Improving Retrieval for RAG based Question Answering Models on Financial Documents0
Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval0
Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction0
Show:102550
← PrevPage 8 of 14Next →

No leaderboard results yet.