SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Title	Date	Tasks	Status	Hype	Score
PeerQA: A Scientific Question Answering Dataset from Peer Reviews	Feb 19, 2025	answerability predictionAnswer Generation	CodeCode Available	1	5
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval	Sep 4, 2024	Image RetrievalRAG	CodeCode Available	1	5
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation	Dec 18, 2023	HallucinationLanguage Modelling	CodeCode Available	1	5
NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering	May 26, 2025	ChunkingLarge Language Model	CodeCode Available	1	5
Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation	Apr 10, 2024	AllRAG	CodeCode Available	1	5
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks	Mar 6, 2024	RAGRetrieval	CodeCode Available	1	5
Neuro-Symbolic Query Compiler	May 17, 2025	RAGResponse Generation	CodeCode Available	1	5
MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG	May 10, 2025	RAGRetrieval	CodeCode Available	1	5
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering	Sep 29, 2024	Graph Question AnsweringQuestion Answering	CodeCode Available	1	5
C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models	Feb 5, 2024	RAGRetrieval	CodeCode Available	1	5

Title

Status

Hype

PeerQA: A Scientific Question Answering Dataset from Peer Reviews

CodeCode Available

NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval

CodeCode Available

"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation

CodeCode Available

NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering

CodeCode Available

Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation