SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 451460 of 2111 papers

TitleStatusHype
PeerQA: A Scientific Question Answering Dataset from Peer ReviewsCode1
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for RetrievalCode1
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented GenerationCode1
NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question AnsweringCode1
Not All Contexts Are Equal: Teaching LLMs Credibility-aware GenerationCode1
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection AttacksCode1
Neuro-Symbolic Query CompilerCode1
MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAGCode1
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question AnsweringCode1
C-RAG: Certified Generation Risks for Retrieval-Augmented Language ModelsCode1
Show:102550
← PrevPage 46 of 212Next →

No leaderboard results yet.