SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 101150 of 14297 papers

TitleStatusHype
Generative Representational Instruction TuningCode4
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt InjectionCode4
RETSim: Resilient and Efficient Text SimilarityCode4
Retrieval-Augmented Generation with Hierarchical KnowledgeCode4
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
OnPrem.LLM: A Privacy-Conscious Document Intelligence ToolkitCode4
Retrieval-Augmented Generation for Large Language Models: A SurveyCode4
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingCode4
A Survey of LLM DATACode4
EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network OperationsCode4
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image RetrievalCode4
Resources for Brewing BEIR: Reproducible Reference Models and an Official LeaderboardCode4
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming HeadsCode4
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information RetrievalCode3
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real WebsitesCode3
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented GenerationCode3
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkCode3
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation ModelsCode3
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language ProcessingCode3
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language ModelsCode3
RAKG:Document-level Retrieval Augmented Knowledge Graph ConstructionCode3
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal RetrieversCode3
Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language ModelsCode3
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language ModelsCode3
Corrective Retrieval Augmented GenerationCode3
OpenResearcher: Unleashing AI for Accelerated Scientific ResearchCode3
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop QueriesCode3
Multi-Head RAG: Solving Multi-Aspect Problems with LLMsCode3
ONE-PEACE: Exploring One General Representation Model Toward Unlimited ModalitiesCode3
Parametric Retrieval Augmented GenerationCode3
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation SystemCode3
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain FrameworkCode3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsCode3
MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare CopilotCode3
MegaPairs: Massive Data Synthesis For Universal Multimodal RetrievalCode3
MDocAgent: A Multi-Modal Multi-Agent Framework for Document UnderstandingCode3
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click LabelsCode3
PathRAG: Pruning Graph-based Retrieval Augmented Generation with Relational PathsCode3
REPLUG: Retrieval-Augmented Black-Box Language ModelsCode3
Ai2 Scholar QA: Organized Literature Synthesis with AttributionCode3
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding ModelsCode3
LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance PropagationCode3
A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation ModelsCode3
CHESS: Contextual Harnessing for Efficient SQL SynthesisCode3
Large Language Model based Long-tail Query Rewriting in Taobao SearchCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning TasksCode3
Show:102550
← PrevPage 3 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified