| RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation | Mar 21, 2025 | Code GenerationNavigate | CodeCode Available | 0 |
| Dialogue Benchmark Generation from Knowledge Graphs with Cost-Effective Retrieval-Augmented LLMs | Jan 17, 2025 | Dialogue GenerationKnowledge Graphs | CodeCode Available | 0 |
| IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios | Sep 24, 2024 | Information RetrievalRAG | CodeCode Available | 0 |
| Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systems | Mar 12, 2024 | Domain AdaptationHallucination | CodeCode Available | 0 |
| Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language Models | Oct 11, 2024 | Legal ReasoningRAG | CodeCode Available | 0 |
| Detecting Manipulated Contents Using Knowledge-Grounded Inference | Apr 29, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| Interpersonal Memory Matters: A New Task for Proactive Dialogue Utilizing Conversational History | Mar 7, 2025 | RAGRetrieval | CodeCode Available | 0 |
| A Methodology for Evaluating RAG Systems: A Case Study On Configuration Dependency Validation | Oct 11, 2024 | HallucinationRAG | CodeCode Available | 0 |
| Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA | Jun 11, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| RACCOON: A Retrieval-Augmented Generation Approach for Location Coordinate Capture from News Articles | Jan 20, 2025 | ArticlesManagement | CodeCode Available | 0 |
| RAC: Efficient LLM Factuality Correction with Retrieval Augmentation | Oct 21, 2024 | RAGRetrieval | CodeCode Available | 0 |
| IntellBot: Retrieval Augmented LLM Chatbot for Cyber Threat Knowledge Delivery | Nov 8, 2024 | ChatbotLarge Language Model | CodeCode Available | 0 |
| RAD-Bench: Evaluating Large Language Models Capabilities in Retrieval Augmented Dialogues | Sep 19, 2024 | RAGRetrieval | CodeCode Available | 0 |
| RadioRAG: Factual large language models for enhanced diagnostics in radiology using online retrieval augmented generation | Jul 22, 2024 | DiagnosticQuestion Answering | CodeCode Available | 0 |
| Satyrn: A Platform for Analytics Augmented Generation | Jun 17, 2024 | RAGRetrieval | CodeCode Available | 0 |
| QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs | Jun 20, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Information Retrieval in the Age of Generative AI: The RGB Model | Apr 29, 2025 | Information RetrievalRAG | CodeCode Available | 0 |
| SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation | Oct 17, 2024 | GSM8KLanguage Modeling | CodeCode Available | 0 |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Apr 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use | May 4, 2025 | Knowledge GraphsLegal Reasoning | CodeCode Available | 0 |
| TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal Prediction | Jul 7, 2023 | Bayesian OptimizationChatbot | CodeCode Available | 0 |
| Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries | Feb 23, 2025 | BenchmarkingImage Retrieval | CodeCode Available | 0 |
| Agentic Search Engine for Real-Time IoT Data | Mar 15, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| Attention Sorting Combats Recency Bias In Long Context Language Models | Sep 28, 2023 | PositionRetrieval | CodeCode Available | 0 |
| QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling | Sep 21, 2024 | Multiple-choicePrompt Engineering | CodeCode Available | 0 |