SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 18511900 of 2111 papers

TitleStatusHype
Tabular Embedding Model (TEM): Finetuning Embedding Models For Tabular RAG Applications0
Generative AI for Low-Carbon Artificial Intelligence of Things with Large Language Models0
Tool Calling: Enhancing Medication Consultation via Retrieval-Augmented Large Language Models0
Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering0
Human-Imperceptible Retrieval Poisoning Attacks in LLM-Powered Applications0
From Local to Global: A Graph RAG Approach to Query-Focused SummarizationCode14
Prompt Leakage effect and defense strategies for multi-turn LLM interactions0
Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real DocumentsCode1
Telco-RAG: Navigating the Challenges of Retrieval-Augmented Language Models for TelecommunicationsCode2
IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents0
Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level PerturbationsCode0
LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented GenerationCode1
Retrieval-Augmented Audio Deepfake Detection0
Evaluating Retrieval Quality in Retrieval-Augmented GenerationCode1
Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQLCode1
Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation0
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation0
RAGAR, Your Falsehood Radar: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language Models0
LongEmbed: Extending Embedding Models for Long Context RetrievalCode2
RAM: Towards an Ever-Improving Memory System by Learning from Communications0
iRAG: Advancing RAG for Videos with an Incremental Approach0
MemLLM: Finetuning LLMs to Use An Explicit Read-Write MemoryCode1
Enhancing Q&A with Domain-Specific Fine-Tuning and Iterative Reasoning: A Comparative Study0
A Survey on Retrieval-Augmented Text Generation for Large Language Models0
Position Engineering: Boosting Large Language Models through Positional Information Manipulation0
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidenceCode1
Spiral of Silence: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question AnsweringCode1
Generative AI Agents with Large Language Model for Satellite Networks via a Mixture of Experts Transmission0
Cross-Data Knowledge Graph Construction for LLM-enabled Educational Question-Answering System: A Case Study at HCMUT0
Generative AI Agent for Next-Generation MIMO Design: Fundamentals, Challenges, and Vision0
Reducing hallucination in structured outputs via Retrieval-Augmented Generation0
Generative Information Retrieval Evaluation0
LLMs in Biomedicine: A study on clinical Named Entity RecognitionCode0
Not All Contexts Are Equal: Teaching LLMs Credibility-aware GenerationCode1
Towards Robustness of Text-to-Visualization Translation against Lexical and Phrasal Variability0
Superposition Prompting: Improving and Accelerating Retrieval-Augmented GenerationCode2
Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology0
AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information RetrievalCode2
RAR-b: Reasoning as Retrieval BenchmarkCode1
Dimensionality Reduction in Sentence Transformer Vector Databases with Fast Fourier Transform0
MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering0
Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models0
IITK at SemEval-2024 Task 2: Exploring the Capabilities of LLMs for Safe Biomedical Natural Language Inference for Clinical TrialsCode0
A Comparison of Methods for Evaluating Generative IRCode0
CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question AnsweringCode1
CONFLARE: CONFormal LArge language model REtrievalCode1
uTeBC-NLP at SemEval-2024 Task 9: Can LLMs be Lateral Thinkers?Code0
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt OptimizationCode4
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systemsCode1
Octopus v2: On-device language model for super agent0
Show:102550
← PrevPage 38 of 43Next →

No leaderboard results yet.