SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 451475 of 2111 papers

TitleStatusHype
POQD: Performance-Oriented Query Decomposer for Multi-vector retrievalCode1
PeerQA: A Scientific Question Answering Dataset from Peer ReviewsCode1
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience ReportCode1
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsCode1
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language ModelsCode1
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point ThinkingCode1
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question CoverageCode1
Plancraft: an evaluation dataset for planning with LLM agentsCode1
Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language ModelCode1
ORAN-Bench-13K: An Open Source Benchmark for Assessing LLMs in Open Radio Access NetworksCode1
OverThink: Slowdown Attacks on Reasoning LLMsCode1
PAKTON: A Multi-Agent Framework for Question Answering in Long Legal AgreementsCode1
Deep Equilibrium Object DetectionCode1
Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation SystemsCode1
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language ModelsCode1
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question AnsweringCode1
C-RAG: Certified Generation Risks for Retrieval-Augmented Language ModelsCode1
PA-RAG: RAG Alignment via Multi-Perspective Preference OptimizationCode1
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMsCode0
NeoQA: Evidence-based Question Answering with Generated News EventsCode0
MuseRAG: Idea Originality Scoring At ScaleCode0
Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class ImbalanceCode0
Attribute or Abstain: Large Language Models as Long Document AssistantsCode0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
Show:102550
← PrevPage 19 of 85Next →

No leaderboard results yet.