SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 501550 of 2111 papers

TitleStatusHype
Conversational Gold: Evaluating Personalized Conversational Search System using Gold NuggetsCode0
Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language ModelsCode0
Out of Style: RAG's Fragility to Linguistic VariationCode0
Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based ReasoningCode0
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting FrameworkCode0
Pairing Analogy-Augmented Generation with Procedural Memory for Procedural Q&ACode0
A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory TextsCode0
On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation SystemsCode0
Continual Stereo Matching of Continuous Driving Scenes With Growing ArchitectureCode0
A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance JudgmentCode0
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
Optimizing and Evaluating Enterprise Retrieval-Augmented Generation (RAG): A Content Design PerspectiveCode0
Multimodal Integrated Knowledge Transfer to Large Language Models through Preference Optimization with Biomedical ApplicationsCode0
Consistent Autoformalization for Constructing Mathematical LibrariesCode0
Unipa-GPT: Large Language Models for university-oriented QA in ItalianCode0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
MuseRAG: Idea Originality Scoring At ScaleCode0
Mitigating Bias in RAG: Controlling the EmbedderCode0
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented GenerationCode0
ACL Ready: RAG Based Assistant for the ACL ChecklistCode0
CommunityKG-RAG: Leveraging Community Structures in Knowledge Graphs for Advanced Retrieval-Augmented Generation in Fact-CheckingCode0
MindScope: Exploring cognitive biases in large language models through Multi-Agent SystemsCode0
Communication- and Computation-Efficient Distributed Submodular Optimization in Robot Mesh NetworksCode0
MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAGCode0
Agentic Search Engine for Real-Time IoT DataCode0
MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented GenerationCode0
Ask, Retrieve, Summarize: A Modular Pipeline for Scientific Literature SummarizationCode0
Medical large language models are easily distractedCode0
Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-ReasoningCode0
NeoQA: Evidence-based Question Answering with Generated News EventsCode0
MaFeRw: Query Rewriting with Multi-Aspect Feedbacks for Retrieval-Augmented Large Language ModelsCode0
Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic ReasoningCode0
ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented GenerationCode0
Agentic Reasoning: Reasoning LLMs with Tools for the Deep ResearchCode0
AR-RAG: Autoregressive Retrieval Augmentation for Image GenerationCode0
ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate DisclosuresCode0
LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device CollaborationCode0
Climate Finance BenchCode0
LTRR: Learning To Rank Retrievers for LLMsCode0
MCCoder: Streamlining Motion Control with LLM-Assisted Code Generation and Rigorous VerificationCode0
Agent-Enhanced Large Language Models for Researching Political InstitutionsCode0
Citegeist: Automated Generation of Related Work Analysis on the arXiv CorpusCode0
LLMs in Biomedicine: A study on clinical Named Entity RecognitionCode0
CiteCheck: Towards Accurate Citation Faithfulness DetectionCode0
CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-TrainingCode0
LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large ContextsCode0
Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practiceCode0
QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option ShufflingCode0
LLM Embedding-based Attribution (LEA): Quantifying Source Contributions to Generative Model's Response for Vulnerability AnalysisCode0
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and MitigationCode0
Show:102550
← PrevPage 11 of 43Next →

No leaderboard results yet.