SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 151200 of 14297 papers

TitleStatusHype
Graph Retrieval-Augmented Generation: A SurveyCode3
OpenResearcher: Unleashing AI for Accelerated Scientific ResearchCode3
BMX: Entropy-weighted Similarity and Semantic-enhanced Lexical SearchCode3
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkCode3
TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSONCode3
Human-like Episodic Memory for Infinite Context LLMsCode3
Scaling Retrieval-Based Language Models with a Trillion-Token DatastoreCode3
Robust Neural Information Retrieval: An Adversarial and Out-of-distribution PerspectiveCode3
Retrieval-augmented generation in multilingual settingsCode3
BERGEN: A Benchmarking Library for Retrieval-Augmented GenerationCode3
Searching for Best Practices in Retrieval-Augmented GenerationCode3
Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?Code3
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive ReasoningCode3
AutoSurvey: Large Language Models Can Automatically Write SurveysCode3
Multi-Head RAG: Solving Multi-Aspect Problems with LLMsCode3
GNN-RAG: Graph Neural Retrieval for Large Language Model ReasoningCode3
CHESS: Contextual Harnessing for Efficient SQL SynthesisCode3
GRAG: Graph Retrieval-Augmented GenerationCode3
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click LabelsCode3
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language ProcessingCode3
Retrieval Head Mechanistically Explains Long-Context FactualityCode3
From Matching to Generation: A Survey on Generative Information RetrievalCode3
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge BasesCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
MagicLens: Self-Supervised Image Retrieval with Open-Ended InstructionsCode3
Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed TomographyCode3
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question ComplexityCode3
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain FrameworkCode3
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of contextCode3
Bridging Language and Items for Retrieval and RecommendationCode3
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal RetrieversCode3
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language ModelsCode3
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning TasksCode3
When Large Language Models Meet Vector Databases: A SurveyCode3
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language ModelsCode3
Corrective Retrieval Augmented GenerationCode3
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop QueriesCode3
INTERS: Unlocking the Power of Large Language Models in Search with Instruction TuningCode3
FinanceBench: A New Benchmark for Financial Question AnsweringCode3
Large Language Model based Long-tail Query Rewriting in Taobao SearchCode3
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human PreferencesCode3
Landmark Attention: Random-Access Infinite Context Length for TransformersCode3
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on WikipediaCode3
ONE-PEACE: Exploring One General Representation Model Toward Unlimited ModalitiesCode3
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content CreationCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
REPLUG: Retrieval-Augmented Black-Box Language ModelsCode3
Vision-Language Pre-training: Basics, Recent Advances, and Future TrendsCode3
A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation ModelsCode3
Show:102550
← PrevPage 4 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified