SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 711720 of 2111 papers

TitleStatusHype
Towards an automated workflow in materials science for combining multi-modal simulative and experimental information using data mining and large language models0
MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval0
Oreo: A Plug-in Context Reconstructor to Enhance Retrieval-Augmented Generation0
Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing0
SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?0
Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge0
Language Models are Few-Shot Graders0
RAPID: Retrieval Augmented Training of Differentially Private Diffusion ModelsCode0
REAL-MM-RAG: A Real-World Multi-Modal Retrieval Benchmark0
SmartLLM: Smart Contract Auditing using Custom Generative AI0
Show:102550
← PrevPage 72 of 212Next →

No leaderboard results yet.