| Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets | Mar 12, 2025 | Answer GenerationConversational Search | CodeCode Available | 0 |
| REALM: RAG-Driven Enhancement of Multimodal Electronic Health Records Analysis via Large Language Models | Feb 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework | Sep 24, 2024 | Benchmarkingcounterfactual | CodeCode Available | 0 |
| Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical Study | Sep 3, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| Wikipedia in the Era of LLMs: Evolution and Risks | Mar 4, 2025 | ArticlesMachine Translation | CodeCode Available | 0 |
| REANIMATOR: Reanimate Retrieval Test Collections with Extracted and Synthetic Resources | Apr 10, 2025 | Information RetrievalRetrieval | CodeCode Available | 0 |
| Adaptive Meta-Learning for Robust Deepfake Detection: A Multi-Agent Framework to Data Drift and Model Generalization | Nov 12, 2024 | Adversarial RobustnessDeepFake Detection | CodeCode Available | 0 |
| FS-RAG: A Frame Semantics Based Approach for Improved Factual Accuracy in Large Language Models | Jun 23, 2024 | RAGRetrieval | CodeCode Available | 0 |
| From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive Grammars | Nov 23, 2024 | DescriptiveIn-Context Learning | CodeCode Available | 0 |
| DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models | Jul 24, 2024 | Retrieval-augmented GenerationWorld Knowledge | CodeCode Available | 0 |
| Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice | Jan 2, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| From Interests to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries | Dec 26, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 |
| ForPKG: A Framework for Constructing Forestry Policy Knowledge Graph and Application Analysis | Nov 17, 2024 | graph constructionKnowledge Graphs | CodeCode Available | 0 |
| Beyond Scores: A Modular RAG-Based System for Automatic Short Answer Scoring with Feedback | Sep 30, 2024 | Few-Shot LearningPrompt Engineering | CodeCode Available | 0 |
| Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges | Jun 12, 2025 | Decision MakingRAG | CodeCode Available | 0 |
| Beyond Benchmarks: Evaluating Embedding Model Similarity for Retrieval Augmented Generation Systems | Jul 11, 2024 | Information RetrievalModel Selection | CodeCode Available | 0 |
| Toward Optimal Search and Retrieval for RAG | Nov 11, 2024 | Question AnsweringRAG | CodeCode Available | 0 |
| Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation | Apr 28, 2025 | ChunkingRAG | CodeCode Available | 0 |
| Are Large Language Models Good at Utility Judgments? | Mar 28, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 |
| A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models | Jan 2, 2024 | Financial AnalysisHallucination | CodeCode Available | 0 |
| Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models | Sep 30, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 |