SOTAVerified

Semantic Retrieval

Papers

Showing 150 of 86 papers

TitleStatusHype
SCBench: A KV Cache-Centric Analysis of Long-Context MethodsCode5
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook RetrievalCode2
Deep Unsupervised Image Hashing by Maximizing Bit EntropyCode1
Semantic Information Recovery in Wireless NetworksCode1
MINERS: Multilingual Language Models as Semantic RetrieversCode1
LLM Based Multi-Agent Generation of Semi-structured Documents from Semantic Templates in the Public Administration DomainCode1
Semantic Models for the First-stage Retrieval: A Comprehensive ReviewCode1
Revealing the Importance of Semantic Retrieval for Machine Reading at ScaleCode1
Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge DistillationCode1
Training Effective Neural Sentence Encoders from Automatically Mined ParaphrasesCode1
Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric LearningCode1
Sentence Representation Learning with Generative Objective rather than Contrastive ObjectiveCode1
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language ModelsCode1
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval0
Dynamic Boundary Time Warping for Sub-sequence Matching with Few Examples0
Engineering RAG Systems for Real-World Applications: Design, Development, and Evaluation0
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training0
Enhancing Relevance of Embedding-based Retrieval at Walmart0
Event-Centric Query Expansion in Web Search0
From Semantic Retrieval to Pairwise Ranking: Applying Deep Learning in E-commerce Search0
Generative or Contrastive? Phrase Reconstruction for Better Sentence Representation Learning0
GNN-Coder: Boosting Semantic Code Retrieval with Combined GNNs and Transformer0
Hashing on Nonlinear Manifolds0
Improving Cross-lingual Representation for Semantic Retrieval with Code-switching0
Improving Tool Retrieval by Leveraging Large Language Models for Query Generation0
Information Retrieval: Recent Advances and Beyond0
How to Make LLMs Strong Node Classifiers?0
Learning Spatially-Aware Language and Audio Embeddings0
LLM Agents as 6G Orchestrator: A Paradigm for Task-Oriented Physical-Layer Automation0
Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing0
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query0
More Like This: Semantic Retrieval with Linguistic Information0
Multilingual Universal Sentence Encoder for Semantic Retrieval0
Multi-task Sentence Encoding Model for Semantic Retrieval in Question Answering Systems0
NIST: An Image Classification Network to Image Semantic Retrieval0
Optimizing Retrieval Augmented Generation for Object Constraint Language0
Pattern retrieval of traffic congestion using graph-based associations of traffic domain-specific features0
Permutation Invariant Strategy Using Transformer Encoders for Table Understanding0
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System0
AmazonQAC: A Large-Scale, Naturalistic Query Autocomplete Dataset0
An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education0
Athena: Retrieval-augmented Legal Judgment Prediction with Large Language Models0
Beyond Instance-Level Image Retrieval: Leveraging Captions to Learn a Global Visual Representation for Semantic Retrieval0
Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine0
Beyond Lexical: A Semantic Retrieval Framework for Textual Search Engine0
Compressing Sentence Representation via Homomorphic Projective Distillation0
Compressing Sentence Representation with maximum Coding Rate Reduction0
Cross-Document Cross-Lingual NLI via RST-Enhanced Graph Fusion and Interpretability Prediction0
CSDR-BERT: a pre-trained scientific dataset match model for Chinese Scientific Dataset Retrieval0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Human baselineSoft-F10.84Unverified
2k-NN with sentence n-grams, GPT-2 embeddings, fICASoft-F10.51Unverified
3DBTW, GPT-1 embeddings, fICASoft-F10.51Unverified
4LSA baselineSoft-F10.39Unverified
5Universal Sentence EncoderSoft-F10.38Unverified
6Sentence BERTSoft-F10.31Unverified