SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 751800 of 2111 papers

TitleStatusHype
ARCS: Agentic Retrieval-Augmented Code Synthesis with Iterative Refinement0
CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck ModelsCode0
AKIBoards: A Structure-Following Multiagent System for Predicting Acute Kidney Injury0
Detecting Manipulated Contents Using Knowledge-Grounded InferenceCode0
Graph RAG for Legal Norms: A Hierarchical and Temporal Approach0
Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses0
Can LLMs Be Trusted for Evaluating RAG Systems? A Survey of Methods and Datasets0
Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented GenerationCode0
SMARTFinRAG: Interactive Modularized Financial RAG BenchmarkCode0
LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection0
RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models0
A model and package for German ColBERT0
CiteFix: Enhancing RAG Accuracy Through Post-Processing Citation Correction0
FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation0
Synergizing RAG and Reasoning: A Systematic Review0
The Viability of Crowdsourcing for RAG EvaluationCode0
Grounded in Context: Retrieval-Based Method for Hallucination Detection0
The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models0
Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges0
LLMs as Data Annotators: How Close Are We to Human Performance0
Efficient Document Retrieval with G-RetrieverCode0
POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications0
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering0
ResNetVLLM-2: Addressing ResNetVLLM's Multi-Modal Hallucinations0
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval0
CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models0
Secure Multifaceted-RAG for Enterprise: Hybrid Knowledge Retrieval with Security Filtering0
Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation0
RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines0
SCRAG: Social Computing-Based Retrieval Augmented Generation for Community Response Forecasting in Social Media Environments0
ACoRN: Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models0
Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild0
InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning0
Estimating Optimal Context Length for Hybrid Retrieval-augmented Multi-document SummarizationCode0
FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents0
A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance JudgmentCode0
ARCeR: an Agentic RAG for the Automated Definition of Cyber Ranges0
On the Feasibility of Using MultiModal LLMs to Execute AR Social Engineering Attacks0
Towards Conversational AI for Human-Machine Collaborative MLOps0
A Visual RAG Pipeline for Few-Shot Fine-Grained Product Classification0
LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation0
Timing Analysis Agent: Autonomous Multi-Corner Multi-Mode (MCMM) Timing Debugging with Timing Debug Relation Graph0
ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search0
Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs0
Towards Automated Safety Requirements Derivation Using Agent-based RAG0
ReZero: Enhancing LLM search ability by trying one-more-time0
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance0
DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify0
VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents0
GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction0
Show:102550
← PrevPage 16 of 43Next →

No leaderboard results yet.