SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 441450 of 2111 papers

TitleStatusHype
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection AttacksCode1
RNNs are not Transformers (Yet): The Key Bottleneck on In-context RetrievalCode1
WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World ScenarioCode1
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question AnsweringCode1
Evaluating Very Long-Term Conversational Memory of LLM AgentsCode1
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation SystemsCode1
What Evidence Do Language Models Find Convincing?Code1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
C-RAG: Certified Generation Risks for Retrieval-Augmented Language ModelsCode1
How well do LLMs cite relevant medical references? An evaluation framework and analysesCode1
Show:102550
← PrevPage 45 of 212Next →

No leaderboard results yet.