SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Title	Date	Tasks	Status
Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering	Nov 14, 2024	Medical Question AnsweringMisinformation	—Unverified
ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems	Jan 14, 2025	Question AnsweringRAG	—Unverified
Composing Open-domain Vision with RAG for Ocean Monitoring and Conservation	Dec 3, 2024	RAGRetrieval	—Unverified
Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework	May 27, 2025	DiagnosticKnowledge Graphs	—Unverified
ASTRAL: Automated Safety Testing of Large Language Models	Jan 28, 2025	RAGRetrieval-augmented Generation	—Unverified
Comparing the Utility, Preference, and Performance of Course Material Search Functionality and Retrieval-Augmented Generation Large Language Model (RAG-LLM) AI Chatbots in Information-Seeking Tasks	Oct 17, 2024	ChatbotLanguage Modeling	—Unverified
Comparative Analysis of Retrieval Systems in the Real World	May 3, 2024	Information RetrievalQuestion Answering	—Unverified
Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies	Dec 12, 2024	Question AnsweringRAG	—Unverified
AgentNet: Decentralized Evolutionary Coordination for LLM-based Multi-Agent Systems	Apr 1, 2025	Privacy PreservingRAG	—Unverified
BadJudge: Backdoor Vulnerabilities of LLM-as-a-Judge	Mar 1, 2025	EthicsModel Selection	—Unverified

Title

Status

Hype

Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering

—Unverified

ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems

—Unverified

Composing Open-domain Vision with RAG for Ocean Monitoring and Conservation

—Unverified

Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework

—Unverified

ASTRAL: Automated Safety Testing of Large Language Models