SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 301350 of 14297 papers

TitleStatusHype
Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains0
SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph RetrievalCode0
Do RAG Systems Suffer From Positional Bias?0
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement LearningCode2
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective0
CRAFT: Training-Free Cascaded Retrieval for Tabular QA0
An Alternative to FLOPS Regularization to Effectively Productionize SPLADE-Doc0
HDLxGraph: Bridging Large Language Models and HDL Repositories via HDL Graph DatabasesCode0
Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval0
MIRB: Mathematical Information Retrieval BenchmarkCode0
The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval AugmentationCode1
ChartCards: A Chart-Metadata Generation Framework for Multi-Task Chart UnderstandingCode0
LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval0
Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization0
Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization0
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models0
SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation0
Knowledge Graph Based Repository-Level Code Generation0
Cross-Domain Diffusion with Progressive Alignment for Efficient Adaptive Retrieval0
RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture UnderstandingCode0
Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency0
Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks0
LoVR: A Benchmark for Long Video Retrieval in Multimodal ContextsCode1
Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment0
RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data0
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement LearningCode1
Rank-K: Test-Time Reasoning for Listwise RerankingCode0
Benchmarking the Myopic Trap: Positional Bias in Information RetrievalCode5
R2MED: A Benchmark for Reasoning-Driven Medical RetrievalCode1
VulCPE: Context-Aware Cybersecurity Vulnerability Retrieval and Management0
Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data0
Beyond Text: Unveiling Privacy Vulnerabilities in Multi-modal Retrieval-Augmented Generation0
NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search0
Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning0
JIR-Arena: The First Benchmark Dataset for Just-in-time Information RecommendationCode0
LLM-Based Compact Reranking with Document Features for Scientific Retrieval0
Cross-Lingual Representation Alignment Through Contrastive Image-Caption TuningCode0
Towards A Generalist Code Embedding Model Based On Massive Data Synthesis0
GuRE:Generative Query REwriter for Legal Passage RetrievalCode1
LLM-based Query Expansion Fails for Unfamiliar and Ambiguous QueriesCode0
RAR: Setting Knowledge Tripwires for Retrieval Augmented Rejection0
Duluth at SemEval-2025 Task 7: TF-IDF with Optimized Vector Dimensions for Multilingual Fact-Checked Claim RetrievalCode0
Optimizing Retrieval Augmented Generation for Object Constraint Language0
Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning0
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive ProgrammingCode2
Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps0
GMM-Based Comprehensive Feature Extraction and Relative Distance Preservation For Few-Shot Cross-Modal Retrieval0
Know3-RAG: A Knowledge-aware RAG Framework with Adaptive Retrieval, Generation, and FilteringCode0
Show:102550
← PrevPage 7 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified