SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 401450 of 14297 papers

TitleStatusHype
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language ModelsCode2
RAGGED: Towards Informed Design of Retrieval Augmented Generation SystemsCode2
EarthLoc: Astronaut Photography Localization by Indexing Earth from SpaceCode2
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-FeedbackCode2
RepoHyper: Search-Expand-Refine on Semantic Graphs for Repository-Level Code CompletionCode2
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text RetrievalCode2
Large Language Models are In-Context Molecule LearnersCode2
Backtracing: Retrieving the Cause of the QueryCode2
Interactive Continual Learning: Fast and Slow ThinkingCode2
Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation ModelsCode2
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented GenerationCode2
Retrieval is Accurate GenerationCode2
Pretrained Visual UncertaintiesCode2
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question AnsweringCode2
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)Code2
Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility GenerationCode2
ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented AgentsCode2
EVOR: Evolving Retrieval for Code GenerationCode2
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language ModelCode2
Distillation Enhanced Generative RetrievalCode2
CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity KnowledgeCode2
Verif.ai: Towards an Open-Source Scientific Generative Question-Answering System with Referenced and Verifiable AnswersCode2
BEBLID: Boosted efficient binary local image descriptorCode2
Retrieval-Augmented Score Distillation for Text-to-3D GenerationCode2
LitLLM: A Toolkit for Scientific Literature ReviewCode2
M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text RetrievalCode2
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and DistillationCode2
Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language ModelsCode2
The Power of Noise: Redefining Retrieval for RAG SystemsCode2
Towards 3D Molecule-Text Interpretation in Language ModelsCode2
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure AnalysisCode2
TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight DetectionCode2
Oceanship: A Large-Scale Dataset for Underwater Audio Target RecognitionCode2
Language-only Training of Zero-shot Composed Image RetrievalCode2
D3still: Decoupled Differential Distillation for Asymmetric Image RetrievalCode2
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language ModelsCode2
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by StepCode2
Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation AccuracyCode2
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote SensingCode2
BIRB: A Generalization Benchmark for Information Retrieval in BioacousticsCode2
Exploring Radar Data Representations in Autonomous Driving: A Comprehensive ReviewCode2
RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!Code2
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation SystemsCode2
Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional TrainingCode2
REST: Retrieval-Based Speculative DecodingCode2
Learning to Filter Context for Retrieval-Augmented GenerationCode2
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsCode2
LLaVA-Plus: Learning to Use Tools for Creating Multimodal AgentsCode2
A Foundation Model for Music InformaticsCode2
DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuningCode2
Show:102550
← PrevPage 9 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified