| BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text | May 22, 2025 | BenchmarkingRAG | —Unverified | 0 |
| MuseRAG: Idea Originality Scoring At Scale | May 22, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| CUB: Benchmarking Context Utilisation Techniques for Language Models | May 22, 2025 | BenchmarkingFact Checking | —Unverified | 0 |
| Attributing Response to Context: A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation | May 22, 2025 | ARCAttribute | —Unverified | 0 |
| InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation | May 21, 2025 | BenchmarkingRAG | —Unverified | 0 |
| Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization | May 21, 2025 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| After Retrieval, Before Generation: Enhancing the Trustworthiness of Large Language Models in RAG | May 21, 2025 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval | May 21, 2025 | RAGRetrieval | —Unverified | 0 |
| Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law | May 21, 2025 | Answer GenerationQuestion Answering | —Unverified | 0 |