SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 241250 of 2111 papers

TitleStatusHype
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation SystemsCode2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
Huatuo-26M, a Large-scale Chinese Medical QA DatasetCode2
SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative RefinementCode1
Constructing and Evaluating Declarative RAG Pipelines in PyTerrierCode1
FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented GenerationCode1
DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMsCode1
SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from DesignCode1
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation SystemsCode1
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff TableCode1
Show:102550
← PrevPage 25 of 212Next →

No leaderboard results yet.