SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 401450 of 2111 papers

TitleStatusHype
Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation TrackCode1
UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document AnalysisCode1
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented GenerationCode1
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted MetadataCode1
R^2AG: Incorporating Retrieval Information into Retrieval Augmented GenerationCode1
Unified Active Retrieval for Retrieval Augmented GenerationCode1
TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented GenerationCode1
R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language ModelsCode1
We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMsCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
RATT: A Thought Structure for Coherent and Correct LLM ReasoningCode1
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial TrainingCode1
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language ModelsCode1
Toward Conversational Agents with Context and Time Sensitive Long-term MemoryCode1
Video Enriched Retrieval Augmented Generation Using Aligned Video CaptionsCode1
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological TextCode1
SynthAI: A Multi Agent Generative AI Framework for Automated Modular HLS Design GenerationCode1
Certifiably Robust RAG against Retrieval CorruptionCode1
G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality ModelsCode1
The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG)Code1
ERAGent: Enhancing Retrieval-Augmented Language Models with Improved Accuracy, Efficiency, and PersonalizationCode1
Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real DocumentsCode1
LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented GenerationCode1
Evaluating Retrieval Quality in Retrieval-Augmented GenerationCode1
Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQLCode1
MemLLM: Finetuning LLMs to Use An Explicit Read-Write MemoryCode1
Spiral of Silence: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question AnsweringCode1
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidenceCode1
Not All Contexts Are Equal: Teaching LLMs Credibility-aware GenerationCode1
RAR-b: Reasoning as Retrieval BenchmarkCode1
CONFLARE: CONFormal LArge language model REtrievalCode1
CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question AnsweringCode1
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systemsCode1
Generation of Asset Administration Shell with Large Language Model Agents: Toward Semantic Interoperability in Digital Twins in the Context of Industry 4.0Code1
LexDrafter: Terminology Drafting for Legislative Documents using Retrieval Augmented GenerationCode1
JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-TuningCode1
S3LLM: Large-Scale Scientific Software Understanding with LLMs using Source, Metadata, and DocumentCode1
Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-BasesCode1
Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health recordsCode1
Federated Recommendation via Hybrid Retrieval Augmented GenerationCode1
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection AttacksCode1
RNNs are not Transformers (Yet): The Key Bottleneck on In-context RetrievalCode1
WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World ScenarioCode1
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question AnsweringCode1
Evaluating Very Long-Term Conversational Memory of LLM AgentsCode1
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation SystemsCode1
What Evidence Do Language Models Find Convincing?Code1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
C-RAG: Certified Generation Risks for Retrieval-Augmented Language ModelsCode1
How well do LLMs cite relevant medical references? An evaluation framework and analysesCode1
Show:102550
← PrevPage 9 of 43Next →

No leaderboard results yet.