SOTAVerified

Semantic Retrieval

Papers

Showing 125 of 86 papers

TitleStatusHype
SCBench: A KV Cache-Centric Analysis of Long-Context MethodsCode5
GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook RetrievalCode2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
MINERS: Multilingual Language Models as Semantic RetrieversCode1
LLM Based Multi-Agent Generation of Semi-structured Documents from Semantic Templates in the Public Administration DomainCode1
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language ModelsCode1
Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge DistillationCode1
Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric LearningCode1
Sentence Representation Learning with Generative Objective rather than Contrastive ObjectiveCode1
Training Effective Neural Sentence Encoders from Automatically Mined ParaphrasesCode1
Semantic Information Recovery in Wireless NetworksCode1
Semantic Models for the First-stage Retrieval: A Comprehensive ReviewCode1
Deep Unsupervised Image Hashing by Maximizing Bit EntropyCode1
Revealing the Importance of Semantic Retrieval for Machine Reading at ScaleCode1
Semantic-enhanced Modality-asymmetric Retrieval for Online E-commerce Search0
Engineering RAG Systems for Real-World Applications: Design, Development, and Evaluation0
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query0
Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis0
Deep Retrieval at CheckThat! 2025: Identifying Scientific Papers from Implicit Social Media Mentions via Hybrid Retrieval and Re-Ranking0
HDLxGraph: Bridging Large Language Models and HDL Repositories via HDL Graph DatabasesCode0
Optimizing Retrieval Augmented Generation for Object Constraint Language0
Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion0
An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education0
RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation0
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Human baselineSoft-F10.84Unverified
2k-NN with sentence n-grams, GPT-2 embeddings, fICASoft-F10.51Unverified
3DBTW, GPT-1 embeddings, fICASoft-F10.51Unverified
4LSA baselineSoft-F10.39Unverified
5Universal Sentence EncoderSoft-F10.38Unverified
6Sentence BERTSoft-F10.31Unverified