SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 101150 of 671 papers

TitleStatusHype
CLIP2Video: Mastering Video-Text Retrieval via Image CLIPCode1
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip RetrievalCode1
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text RetrievalCode1
CLIP-Lite: Information Efficient Visual Representation Learning with Language SupervisionCode1
LDMol: Text-to-Molecule Diffusion Model with Structurally Informative Latent SpaceCode1
Hyperbolic Image-Text RepresentationsCode1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneCode1
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents IntegrationCode1
A Comprehensive Review of the Video-to-Text ProblemCode1
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
HANet: Hierarchical Alignment Networks for Video-Text RetrievalCode1
ComCLIP: Training-Free Compositional Image and Text MatchingCode1
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language BenchmarkCode1
Bridging Language Gaps in Audio-Text RetrievalCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text RetrievalCode1
Consensus-Aware Visual-Semantic Embedding for Image-Text MatchingCode1
Graph Optimal Transport for Cross-Domain AlignmentCode1
Helping Hands: An Object-Aware Ego-Centric Video Recognition ModelCode1
I0T: Embedding Standardization Method Towards Zero Modality GapCode1
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training ModelCode1
Bridging Video-text Retrieval with Multiple Choice QuestionsCode1
Audio Retrieval with Natural Language Queries: A Benchmark StudyCode1
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional UnderstandingCode1
Contrastive Audio-Language Learning for MusicCode1
Global and Local Semantic Completion Learning for Vision-Language Pre-trainingCode1
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image RecognitionCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
GLEN: Generative Retrieval via Lexical Index LearningCode1
GOAL: Global-local Object Alignment LearningCode1
Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial TrajectoryCode1
FuseCap: Leveraging Large Language Models for Enriched Fused Image CaptionsCode1
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based SearchCode1
Generative Multi-hop RetrievalCode1
Image-text Retrieval via Preserving Main Semantics of VisionCode1
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text RetrievalCode1
FlexiViT: One Model for All Patch SizesCode1
A Data-Centric Framework for Composable NLP WorkflowsCode1
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal MappingCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding EvaluationCode1
FILIP: Fine-grained Interactive Language-Image Pre-TrainingCode1
Fine-Tuning LLaMA for Multi-Stage Text RetrievalCode1
DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge BasesCode1
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalCode1
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
Cross-Modal Retrieval with Partially Mismatched PairsCode1
Show:102550
← PrevPage 3 of 14Next →

No leaderboard results yet.