| BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law | May 21, 2025 | Answer GenerationQuestion Answering | —Unverified | 0 |
| InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation | May 21, 2025 | BenchmarkingRAG | —Unverified | 0 |
| Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs | May 21, 2025 | Knowledge DistillationKnowledge Graphs | CodeCode Available | 1 |
| Do RAG Systems Suffer From Positional Bias? | May 21, 2025 | RAGRetrieval | —Unverified | 0 |
| Silent Leaks: Implicit Knowledge Extraction Attack on RAG Systems through Benign Queries | May 21, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 1 |
| Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptive Plan-Execute Framework for Smart Contract Security Auditing | May 21, 2025 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization | May 21, 2025 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| HDLxGraph: Bridging Large Language Models and HDL Repositories via HDL Graph Databases | May 21, 2025 | RAGRetrieval | CodeCode Available | 0 |
| Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval | May 21, 2025 | RAGRetrieval | —Unverified | 0 |
| Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization | May 20, 2025 | HallucinationIn-Context Learning | —Unverified | 0 |
| SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation | May 20, 2025 | Document Layout Analysisobject-detection | —Unverified | 0 |
| Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | May 20, 2025 | Anomaly DetectionDescriptive | —Unverified | 0 |
| s3: You Don't Need That Much Data to Train a Search Agent via RL | May 20, 2025 | RAGReinforcement Learning (RL) | CodeCode Available | 4 |
| Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks | May 20, 2025 | Dataset GenerationQuestion Answering | —Unverified | 0 |
| Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds | May 20, 2025 | Causal Inferencecounterfactual | CodeCode Available | 0 |
| RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding | May 20, 2025 | Image CaptioningQuestion Answering | CodeCode Available | 0 |
| Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning | May 20, 2025 | Answer GenerationRAG | CodeCode Available | 1 |
| Beyond Text: Unveiling Privacy Vulnerabilities in Multi-modal Retrieval-Augmented Generation | May 20, 2025 | Privacy PreservingRAG | —Unverified | 0 |
| Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning | May 20, 2025 | Attributegraph partitioning | —Unverified | 0 |
| Know Or Not: a library for evaluating out-of-knowledge base robustness | May 19, 2025 | HallucinationRAG | CodeCode Available | 1 |
| AMAQA: A Metadata-based QA Dataset for RAG Systems | May 19, 2025 | Question AnsweringRAG | —Unverified | 0 |
| A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs | May 19, 2025 | Machine Translationnamed-entity-recognition | CodeCode Available | 0 |
| Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain | May 19, 2025 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability | May 19, 2025 | RAGReinforcement Learning (RL) | CodeCode Available | 1 |