SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Title	Date	Tasks	Status	Hype
OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit	May 12, 2025	GPUPrivacy Preserving	CodeCode Available	4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation	Aug 8, 2024	ChunkingFact Checking	CodeCode Available	4
Benchmarking Retrieval-Augmented Generation for Medicine	Feb 20, 2024	BenchmarkingInformation Retrieval	CodeCode Available	4
Retrieval-Augmented Generation for Large Language Models: A Survey	Dec 18, 2023	HallucinationRAG	CodeCode Available	4
COS-Mix: Cosine Similarity and Distance Fusion for Improved Information Retrieval	Jun 2, 2024	Information RetrievalRAG	CodeCode Available	4
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss	Feb 16, 2024	RAG	CodeCode Available	4
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection	Oct 17, 2023	Fact VerificationQuestion Answering	CodeCode Available	4
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis	May 22, 2025	DiversityInformation Retrieval	CodeCode Available	4
DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments	Apr 4, 2025	NavigatePrompt Engineering	CodeCode Available	4
Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions	Aug 1, 2024	Medical Question AnsweringMedQA	CodeCode Available	4

Title

Status

Hype

OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit

CodeCode Available

Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation

CodeCode Available

Benchmarking Retrieval-Augmented Generation for Medicine

CodeCode Available

Retrieval-Augmented Generation for Large Language Models: A Survey

CodeCode Available

COS-Mix: Cosine Similarity and Distance Fusion for Improved Information Retrieval