SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 126150 of 2111 papers

TitleStatusHype
DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG Rerankers0
Contextual Memory Intelligence -- A Foundational Paradigm for Human-AI Collaboration and Reflective Generative AI Systems0
RAG-Zeval: Towards Robust and Interpretable Evaluation on RAG Responses through End-to-End Rule-Guided Reasoning0
VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement LearningCode3
SkewRoute: Training-Free LLM Routing for Knowledge Graph Retrieval-Augmented Generation via Score Skewness of Retrieved Context0
RAGPPI: RAG Benchmark for Protein-Protein Interactions in Drug DiscoveryCode0
Cross-modal RAG: Sub-dimensional Retrieval-Augmented Text-to-Image GenerationCode0
Climate Finance BenchCode0
Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems0
DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation0
Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference0
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration0
Rethinking Chunk Size For Long-Document Retrieval: A Multi-Dataset AnalysisCode0
What LLMs Miss in Recommendations: Bridging the Gap with Retrieval-Augmented Collaborative Signals0
Diagnosing and Resolving Cloud Platform Instability with Multi-modal RAG LLMs0
Public Discourse Sandbox: Facilitating Human and AI Digital Communication Research0
AI-Supported Platform for System Monitoring and Decision-Making in Nuclear Waste Management with Large Language Models0
Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework0
LLM Web Dynamics: Tracing Model Collapse in a Network of LLMs0
Project Riley: Multimodal Multi-Agent LLM Collaboration with Emotional Reasoning and Voting0
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement LearningCode1
Benchmarking Multimodal Knowledge Conflict for Large Multimodal ModelsCode1
syftr: Pareto-Optimal Generative AICode3
Anveshana: A New Benchmark Dataset for Cross-Lingual Information Retrieval On English Queries and Sanskrit Documents0
NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question AnsweringCode1
Show:102550
← PrevPage 6 of 85Next →

No leaderboard results yet.