| mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs | May 16, 2025 | Information RetrievalKnowledge Graphs | CodeCode Available | 1 | 5 |
| Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage | Oct 20, 2024 | Answer GenerationRAG | CodeCode Available | 1 | 5 |
| Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs | Feb 13, 2025 | BenchmarkingRetrieval | CodeCode Available | 1 | 5 |
| DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation | Jun 9, 2024 | Common Sense ReasoningDenoising | CodeCode Available | 1 | 5 |
| Docopilot: Improving Multimodal Models for Document-Level Understanding | Jan 1, 2025 | document understandingRAG | CodeCode Available | 1 | 5 |
| BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine | May 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation | Dec 17, 2024 | Question AnsweringRAG | CodeCode Available | 1 | 5 |
| ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text | May 26, 2024 | Arrhythmia DetectionRAG | CodeCode Available | 1 | 5 |
| ECoRAG: Evidentiality-guided Compression for Long Context RAG | Jun 5, 2025 | Answer GenerationOpen-Domain Question Answering | CodeCode Available | 1 | 5 |
| One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models | May 30, 2024 | Question AnsweringRAG | CodeCode Available | 1 | 5 |