SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Title	Date	Tasks	Status	Hype
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks	Mar 6, 2024	RAGRetrieval	CodeCode Available	1
RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval	Feb 28, 2024	RAGRetrieval	CodeCode Available	1
WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario	Feb 28, 2024	ArticlesRAG	CodeCode Available	1
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering	Feb 27, 2024	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	1
Evaluating Very Long-Term Conversational Memory of LLM Agents	Feb 27, 2024	AvgDialogue Generation	CodeCode Available	1
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems	Feb 27, 2024	Instruction FollowingRAG	CodeCode Available	1
What Evidence Do Language Models Find Convincing?	Feb 19, 2024	counterfactualMisinformation	CodeCode Available	1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation	Feb 12, 2024	Decision MakingLanguage Modeling	CodeCode Available	1
C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models	Feb 5, 2024	RAGRetrieval	CodeCode Available	1
How well do LLMs cite relevant medical references? An evaluation framework and analyses	Feb 3, 2024	RAGRetrieval-augmented Generation	CodeCode Available	1

Title

Status

Hype

Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks

CodeCode Available

RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval

CodeCode Available

WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario

CodeCode Available

REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering

CodeCode Available

Evaluating Very Long-Term Conversational Memory of LLM Agents