| Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities | Jun 17, 2024 | Question AnsweringRAG | CodeCode Available | 0 |
| Know3-RAG: A Knowledge-aware RAG Framework with Adaptive Retrieval, Generation, and Filtering | May 19, 2025 | Knowledge GraphsRAG | CodeCode Available | 0 |
| PROPHET: An Inferable Future Forecasting Benchmark with Causal Intervened Likelihood Estimation | Apr 2, 2025 | ArticlesCausal Inference | CodeCode Available | 0 |
| DRAFT-ing Architectural Design Decisions using LLMs | Apr 11, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning | Mar 21, 2024 | MemorizationRetrieval | CodeCode Available | 0 |
| Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations | Feb 21, 2025 | ArticlesFraud Detection | CodeCode Available | 0 |
| An AI-Driven Live Systematic Reviews in the Brain-Heart Interconnectome: Minimizing Research Waste and Advancing Evidence Synthesis | Jan 25, 2025 | Decision MakingRAG | CodeCode Available | 0 |
| KBAlign: Efficient Self Adaptation on Specific Knowledge Bases | Nov 22, 2024 | Question AnsweringRAG | CodeCode Available | 0 |
| DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented Generation | May 15, 2025 | graph constructionHallucination | CodeCode Available | 0 |
| Do "New Snow Tablets" Contain Snow? Large Language Models Over-Rely on Names to Identify Ingredients of Chinese Drugs | Apr 3, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| JointRank: Rank Large Set with Single Pass | Jun 27, 2025 | Information RetrievalReranking | CodeCode Available | 0 |
| JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability | Feb 27, 2024 | GPUInformation Retrieval | CodeCode Available | 0 |
| Automated Bias Assessment in AI-Generated Educational Content Using CEAT Framework | May 19, 2025 | FairnessRetrieval-augmented Generation | CodeCode Available | 0 |
| AIC CTU system at AVeriTeC: Re-framing automated fact-checking as a simple RAG task | Oct 15, 2024 | Data AugmentationFact Checking | CodeCode Available | 0 |
| Quebec Automobile Insurance Question-Answering With Retrieval-Augmented Generation | Oct 12, 2024 | Question AnsweringRAG | CodeCode Available | 0 |
| Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems | Sep 29, 2024 | FairnessOpen-Domain Question Answering | CodeCode Available | 0 |
| A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory Texts | Feb 24, 2025 | Answer GenerationInformation Retrieval | CodeCode Available | 0 |
| THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models | Sep 17, 2024 | BenchmarkingBinary Classification | CodeCode Available | 0 |
| Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks | Apr 17, 2025 | Epistemic ReasoningLarge Language Model | CodeCode Available | 0 |
| Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings | Mar 19, 2025 | Instruction FollowingLarge Language Model | CodeCode Available | 0 |
| A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment | Apr 16, 2025 | Information RetrievalRAG | CodeCode Available | 0 |
| Can Github issues be solved with Tree Of Thoughts? | May 20, 2024 | Code GenerationGitHub issue resolution | CodeCode Available | 0 |
| A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia | Dec 4, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents | Nov 23, 2024 | Question AnsweringRAG | CodeCode Available | 0 |
| DIRAS: Efficient LLM Annotation of Document Relevance in Retrieval Augmented Generation | Jun 20, 2024 | Information RetrievalRAG | CodeCode Available | 0 |
| RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation | Mar 21, 2025 | Code GenerationNavigate | CodeCode Available | 0 |
| Dialogue Benchmark Generation from Knowledge Graphs with Cost-Effective Retrieval-Augmented LLMs | Jan 17, 2025 | Dialogue GenerationKnowledge Graphs | CodeCode Available | 0 |
| IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios | Sep 24, 2024 | Information RetrievalRAG | CodeCode Available | 0 |
| Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systems | Mar 12, 2024 | Domain AdaptationHallucination | CodeCode Available | 0 |
| Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language Models | Oct 11, 2024 | Legal ReasoningRAG | CodeCode Available | 0 |
| Detecting Manipulated Contents Using Knowledge-Grounded Inference | Apr 29, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| Interpersonal Memory Matters: A New Task for Proactive Dialogue Utilizing Conversational History | Mar 7, 2025 | RAGRetrieval | CodeCode Available | 0 |
| A Methodology for Evaluating RAG Systems: A Case Study On Configuration Dependency Validation | Oct 11, 2024 | HallucinationRAG | CodeCode Available | 0 |
| Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA | Jun 11, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| RACCOON: A Retrieval-Augmented Generation Approach for Location Coordinate Capture from News Articles | Jan 20, 2025 | ArticlesManagement | CodeCode Available | 0 |
| RAC: Efficient LLM Factuality Correction with Retrieval Augmentation | Oct 21, 2024 | RAGRetrieval | CodeCode Available | 0 |
| IntellBot: Retrieval Augmented LLM Chatbot for Cyber Threat Knowledge Delivery | Nov 8, 2024 | ChatbotLarge Language Model | CodeCode Available | 0 |
| RAD-Bench: Evaluating Large Language Models Capabilities in Retrieval Augmented Dialogues | Sep 19, 2024 | RAGRetrieval | CodeCode Available | 0 |
| RadioRAG: Factual large language models for enhanced diagnostics in radiology using online retrieval augmented generation | Jul 22, 2024 | DiagnosticQuestion Answering | CodeCode Available | 0 |
| Satyrn: A Platform for Analytics Augmented Generation | Jun 17, 2024 | RAGRetrieval | CodeCode Available | 0 |
| QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs | Jun 20, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Information Retrieval in the Age of Generative AI: The RGB Model | Apr 29, 2025 | Information RetrievalRAG | CodeCode Available | 0 |
| SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation | Oct 17, 2024 | GSM8KLanguage Modeling | CodeCode Available | 0 |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Apr 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use | May 4, 2025 | Knowledge GraphsLegal Reasoning | CodeCode Available | 0 |
| TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal Prediction | Jul 7, 2023 | Bayesian OptimizationChatbot | CodeCode Available | 0 |
| Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries | Feb 23, 2025 | BenchmarkingImage Retrieval | CodeCode Available | 0 |
| Agentic Search Engine for Real-Time IoT Data | Mar 15, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| Attention Sorting Combats Recency Bias In Long Context Language Models | Sep 28, 2023 | PositionRetrieval | CodeCode Available | 0 |
| QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling | Sep 21, 2024 | Multiple-choicePrompt Engineering | CodeCode Available | 0 |