| LegalBench-RAG: A Benchmark for Retrieval-Augmented Generation in the Legal Domain | Aug 19, 2024 | RAGRetrieval | CodeCode Available | 2 | 5 |
| LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers | Feb 25, 2025 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 2 | 5 |
| Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA | Jun 25, 2024 | BenchmarkingLong-Context Understanding | CodeCode Available | 2 | 5 |
| Learning to Filter Context for Retrieval-Augmented Generation | Nov 14, 2023 | Extractive Question-AnsweringFact Verification | CodeCode Available | 2 | 5 |
| RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering | Feb 26, 2024 | FormOpen-Domain Question Answering | CodeCode Available | 2 | 5 |
| A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering | Nov 13, 2023 | Decision MakingExplanation Generation | CodeCode Available | 1 | 5 |
| Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases | Mar 15, 2024 | RAGRetrieval | CodeCode Available | 1 | 5 |
| SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models | Feb 28, 2025 | AttributeAutonomous Driving | CodeCode Available | 1 | 5 |
| Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory | May 3, 2023 | Abstractive Text SummarizationDialogue Generation | CodeCode Available | 1 | 5 |
| Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | May 31, 2024 | HallucinationMulti-Task Learning | CodeCode Available | 1 | 5 |