| LibreLog: Accurate and Efficient Unsupervised Log Parsing Using Open-Source Large Language Models | Aug 2, 2024 | In-Context LearningLog Parsing | CodeCode Available | 1 | 5 |
| Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End System | Apr 12, 2025 | Information RetrievalRAG | CodeCode Available | 1 | 5 |
| Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning | May 20, 2025 | Answer GenerationRAG | CodeCode Available | 1 | 5 |
| Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model | Oct 13, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 | 5 |
| Personalized Graph-Based Retrieval for Large Language Models | Jan 4, 2025 | Knowledge GraphsRetrieval | CodeCode Available | 1 | 5 |
| MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking | Feb 28, 2025 | RAGRetrieval | CodeCode Available | 1 | 5 |
| CompAct: Compressing Retrieved Documents Actively for Question Answering | Jul 12, 2024 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 1 | 5 |
| Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs | May 21, 2025 | Knowledge DistillationKnowledge Graphs | CodeCode Available | 1 | 5 |
| GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development | Mar 2, 2025 | Code GenerationProgram Synthesis | CodeCode Available | 1 | 5 |
| Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models | Mar 20, 2025 | counterfactualRAG | CodeCode Available | 1 | 5 |
| Plancraft: an evaluation dataset for planning with LLM agents | Dec 30, 2024 | Decision MakingMinecraft | CodeCode Available | 1 | 5 |
| PAKTON: A Multi-Agent Framework for Question Answering in Long Legal Agreements | May 31, 2025 | Privacy PreservingQuestion Answering | CodeCode Available | 1 | 5 |
| Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems | Mar 19, 2025 | Question AnsweringRAG | CodeCode Available | 1 | 5 |
| ORAN-Bench-13K: An Open Source Benchmark for Assessing LLMs in Open Radio Access Networks | Jul 8, 2024 | Anomaly DetectionCode Generation | CodeCode Available | 1 | 5 |
| Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQL | Apr 19, 2024 | RAGRetrieval | CodeCode Available | 1 | 5 |
| CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering | Sep 29, 2024 | Graph Question AnsweringQuestion Answering | CodeCode Available | 1 | 5 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| CONFLARE: CONFormal LArge language model REtrieval | Apr 4, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Metacognitive Retrieval-Augmented Large Language Models | Feb 18, 2024 | Response GenerationRetrieval | CodeCode Available | 1 | 5 |
| PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization | Dec 19, 2024 | InformativenessRAG | CodeCode Available | 1 | 5 |
| ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation | Mar 27, 2025 | Question AnsweringRAG | CodeCode Available | 1 | 5 |
| SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative Refinement | Jun 16, 2025 | document understandingQuestion Answering | CodeCode Available | 1 | 5 |
| ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator | May 28, 2024 | Information RetrievalLanguage Modelling | CodeCode Available | 0 | 5 |
| A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory Texts | Feb 24, 2025 | Answer GenerationInformation Retrieval | CodeCode Available | 0 | 5 |
| A Tale of Trust and Accuracy: Base vs. Instruct LLMs in RAG Systems | Jun 21, 2024 | RAGRetrieval | CodeCode Available | 0 | 5 |
| A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment | Apr 16, 2025 | Information RetrievalRAG | CodeCode Available | 0 | 5 |
| A System for Comprehensive Assessment of RAG Frameworks | Apr 10, 2025 | RAGRetrieval | CodeCode Available | 0 | 5 |
| A Comparison of Methods for Evaluating Generative IR | Apr 5, 2024 | Information RetrievalLanguage Modelling | CodeCode Available | 0 | 5 |
| On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems | Feb 20, 2025 | Long Form Question AnsweringQuestion Answering | CodeCode Available | 0 | 5 |
| Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets | Mar 12, 2025 | Answer GenerationConversational Search | CodeCode Available | 0 | 5 |
| NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question Answering | Feb 15, 2025 | ChunkingInformation Retrieval | CodeCode Available | 0 | 5 |
| NeoQA: Evidence-based Question Answering with Generated News Events | May 9, 2025 | ArticlesQuestion Answering | CodeCode Available | 0 | 5 |
| QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling | Sep 21, 2024 | Multiple-choicePrompt Engineering | CodeCode Available | 0 | 5 |
| MuseRAG: Idea Originality Scoring At Scale | May 22, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance | Jan 21, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation | Oct 29, 2024 | AllRetrieval | CodeCode Available | 0 | 5 |
| A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia | Dec 4, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 | 5 |
| Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation | Jun 1, 2024 | ChunkingRAG | CodeCode Available | 0 | 5 |
| Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework | Sep 24, 2024 | Benchmarkingcounterfactual | CodeCode Available | 0 | 5 |
| Mitigating Bias in RAG: Controlling the Embedder | Feb 24, 2025 | FairnessRAG | CodeCode Available | 0 | 5 |
| Consistent Autoformalization for Constructing Mathematical Libraries | Oct 5, 2024 | DenoisingRAG | CodeCode Available | 0 | 5 |
| ConQRet: Benchmarking Fine-Grained Evaluation of Retrieval Augmented Argumentation with LLM Judges | Dec 6, 2024 | BenchmarkingRetrieval | CodeCode Available | 0 | 5 |
| MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge | Dec 22, 2024 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 0 | 5 |
| Unipa-GPT: Large Language Models for university-oriented QA in Italian | Jul 19, 2024 | ChatbotInformation Retrieval | CodeCode Available | 0 | 5 |
| Concurrent Brainstorming & Hypothesis Satisfying: An Iterative Framework for Enhanced Retrieval-Augmented Generation (R2CBR3H-SR) | Jan 3, 2024 | Decision MakingInformation Retrieval | CodeCode Available | 0 | 5 |
| Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning | Jun 5, 2025 | Question AnsweringRAG | CodeCode Available | 0 | 5 |
| Memorization and Knowledge Injection in Gated LLMs | Apr 30, 2025 | Continual LearningMemorization | CodeCode Available | 0 | 5 |
| Memory and Knowledge Augmented Language Models for Inferring Salience in Long-Form Stories | Sep 8, 2021 | FormLanguage Modeling | CodeCode Available | 0 | 5 |
| MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation | Feb 24, 2025 | RAGRetrieval | CodeCode Available | 0 | 5 |