| CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity Knowledge | Feb 12, 2024 | General KnowledgeMultiple-choice | CodeCode Available | 2 | 5 |
| Retrieval-Augmented Perception: High-Resolution Image Perception Meets Visual RAG | Mar 3, 2025 | RAGRetrieval | CodeCode Available | 2 | 5 |
| Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework | Jun 20, 2024 | HallucinationQuestion Answering | CodeCode Available | 2 | 5 |
| MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search | Mar 26, 2025 | Decision MakingRAG | CodeCode Available | 2 | 5 |
| Datrics Text2SQL. A Framework for Natural Language to SQL Query Generation | Mar 15, 2025 | Natural Language QueriesRAG | CodeCode Available | 2 | 5 |
| ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents | Feb 21, 2024 | Active LearningPosition | CodeCode Available | 2 | 5 |
| AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval | Apr 9, 2024 | AllInformation Retrieval | CodeCode Available | 2 | 5 |
| ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems | Nov 16, 2023 | RAGRetrieval | CodeCode Available | 2 | 5 |
| LLM-based SPARQL Query Generation from Natural Language over Federated Knowledge Graphs | Oct 8, 2024 | Knowledge GraphsRAG | CodeCode Available | 2 | 5 |
| KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model | Jan 2, 2025 | MTEB BenchmarkRetrieval-augmented Generation | CodeCode Available | 2 | 5 |
| Benchmarking Large Language Models in Retrieval-Augmented Generation | Sep 4, 2023 | Benchmarkingcounterfactual | CodeCode Available | 2 | 5 |
| LLM4Ranking: An Easy-to-use Framework of Utilizing Large Language Models for Document Reranking | Apr 10, 2025 | RerankingRetrieval-augmented Generation | CodeCode Available | 2 | 5 |
| EVOR: Evolving Retrieval for Code Generation | Feb 19, 2024 | Code GenerationRAG | CodeCode Available | 2 | 5 |
| Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models | Jan 27, 2024 | Medical Question AnsweringMultiple-choice | CodeCode Available | 2 | 5 |
| LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering | Oct 23, 2024 | ChunkingQuestion Answering | CodeCode Available | 2 | 5 |
| Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models | Apr 15, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 | 5 |
| LitLLM: A Toolkit for Scientific Literature Review | Feb 2, 2024 | RAGRetrieval | CodeCode Available | 2 | 5 |
| DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning | Oct 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation | Jan 30, 2024 | HallucinationKnowledge Distillation | CodeCode Available | 2 | 5 |
| LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search | Oct 24, 2024 | ClusteringGPU | CodeCode Available | 2 | 5 |
| LegalBench-RAG: A Benchmark for Retrieval-Augmented Generation in the Legal Domain | Aug 19, 2024 | RAGRetrieval | CodeCode Available | 2 | 5 |
| Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA | Jun 25, 2024 | BenchmarkingLong-Context Understanding | CodeCode Available | 2 | 5 |
| CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation | Oct 30, 2024 | BenchmarkingPassage Retrieval | CodeCode Available | 2 | 5 |
| DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models | Mar 15, 2024 | RAGRetrieval | CodeCode Available | 2 | 5 |
| EfficientRAG: Efficient Retriever for Multi-Hop Question Answering | Aug 8, 2024 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 2 | 5 |