SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 18211830 of 2111 papers

TitleStatusHype
Evaluating Students' Open-ended Written Responses with LLMs: Using the RAG Framework for GPT-3.5, GPT-4, Claude-3, and Mistral-Large0
Evaluating the Effect of Retrieval Augmentation on Social Biases0
Evaluating the Impact of Advanced LLM Techniques on AI-Lecture Tutors for a Robotics Course0
Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain0
Evaluating the Retrieval Component in LLM-Based Question Answering Systems0
Evaluating Transferability in Retrieval Tasks: An Approach Using MMD and Kernel Methods0
Evaluation of Attribution Bias in Retrieval-Augmented Large Language Models0
Evaluation of RAG Metrics for Question Answering in the Telecom Domain0
Evaluation of Semantic Search and its Role in Retrieved-Augmented-Generation (RAG) for Arabic Language0
EventChat: Implementation and user-centric evaluation of a large language model-driven conversational recommender system for exploring leisure events in an SME context0
Show:102550
← PrevPage 183 of 212Next →

No leaderboard results yet.