SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 40014050 of 14297 papers

TitleStatusHype
LitLLM: A Toolkit for Scientific Literature ReviewCode2
CoLe and LYS at BioASQ MESINESP8 Task: similarity based descriptor assignment in Spanish0
CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks0
RimiRec: Modeling Refined Multi-interest in Hierarchical Structure for Recommendation0
Continual Learning for Large Language Models: A Survey0
The Queen of England is not England's Queen: On the Lack of Factual Coherency in PLMsCode0
PICS: Pipeline for Image Captioning and Search0
HiQA: A Hierarchical Contextual Augmentation RAG for Multi-Documents QA0
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM CollaborationCode1
An Exam-based Evaluation Approach Beyond Traditional Relevance Judgments0
Health-LLM: Personalized Retrieval-Augmented Disease Prediction System0
RAG-Fusion: a New Take on Retrieval-Augmented Generation0
M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text RetrievalCode2
RAPTOR: Recursive Abstractive Processing for Tree-Organized RetrievalCode5
GUMsley: Evaluating Entity Salience in Summarization for 12 English Genres0
Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels0
When Large Language Models Meet Vector Databases: A SurveyCode3
Retrieval Augmented Deep Anomaly Detection for Tabular DataCode0
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language ModelsCode3
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and DistillationCode2
NNOSE: Nearest Neighbor Occupational Skill ExtractionCode0
Re3val: Reinforced and Reranked Generative Retrieval0
History-Aware Conversational Dense RetrievalCode1
NFT1000: A Cross-Modal Dataset for Non-Fungible Token RetrievalCode1
Development and Testing of Retrieval Augmented Generation in Large Language Models -- A Case Study Report0
Effective Controllable Bias Mitigation for Classification and Retrieval using Gate AdaptersCode0
Vocabulary-Defined Semantics: Latent Space Clustering for Improving In-Context Learning0
MT-HCCAR: Multi-Task Deep Learning with Hierarchical Classification and Attention-based Regression for Cloud Property RetrievalCode0
Corrective Retrieval Augmented GenerationCode3
KAUCUS: Knowledge Augmented User Simulators for Training Language Model Assistants0
Textual Entailment for Effective Triple Validation in Object PredictionCode0
LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning0
Cross-Modal Coordination Across a Diverse Set of Input Modalities0
Fine-Tuned Large Language Models for Symptom Recognition from Spanish Clinical Text0
PILOT: Legal Case Outcome Prediction with Case Law0
Enhancing Large Language Model Performance To Answer Questions and Extract Information More Accurately0
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop QueriesCode3
Music Auto-Tagging with Robust Music Representation Learned via Domain Adversarial Training0
Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval0
DataFrame QA: A Universal LLM Framework on DataFrame Question Answering Without Data Exposure0
Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language ModelsCode2
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance ProcessCode1
Expressivity-aware Music Performance Retrieval using Mid-level Perceptual Features and Emotion Word Embeddings0
The Power of Noise: Redefining Retrieval for RAG SystemsCode2
Recency Ranking by Diversification of Result Set0
K-QA: A Real-World Medical Q&A BenchmarkCode1
Ta'keed: The First Generative Fact-Checking System for Arabic Claims0
Accelerating Retrieval-Augmented Language Model Serving with Speculation0
Towards 3D Molecule-Text Interpretation in Language ModelsCode2
It's About Time: Incorporating Temporality in Retrieval Augmented Language Models0
Show:102550
← PrevPage 81 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified