SOTAVerified

Retrieval-augmented Generation

Papers

Showing 201225 of 2196 papers

TitleStatusHype
BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law0
InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation0
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge GraphsCode1
Do RAG Systems Suffer From Positional Bias?0
Silent Leaks: Implicit Knowledge Extraction Attack on RAG Systems through Benign QueriesCode1
Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions0
Adaptive Plan-Execute Framework for Smart Contract Security Auditing0
Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization0
HDLxGraph: Bridging Large Language Models and HDL Repositories via HDL Graph DatabasesCode0
Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval0
Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization0
SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation0
Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models0
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks0
Causal Cartographer: From Mapping to Reasoning Over Counterfactual WorldsCode0
RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture UnderstandingCode0
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement LearningCode1
Beyond Text: Unveiling Privacy Vulnerabilities in Multi-modal Retrieval-Augmented Generation0
Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning0
Know Or Not: a library for evaluating out-of-knowledge base robustnessCode1
AMAQA: A Metadata-based QA Dataset for RAG Systems0
A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMsCode0
Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain0
Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision TraceabilityCode1
Show:102550
← PrevPage 9 of 88Next →

No leaderboard results yet.