| KG-QAGen: A Knowledge-Graph-Based Framework for Systematic Question Generation and Long-Context LLM Evaluation | May 18, 2025 | Answer GenerationImplicit Relations | CodeCode Available | 0 |
| RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR Environments | Apr 11, 2025 | Answer GenerationQuestion Answering | CodeCode Available | 0 |
| Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG | Apr 7, 2025 | Answer GenerationRAG | —Unverified | 0 |
| Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching | Apr 3, 2025 | Answer GenerationEEG | CodeCode Available | 0 |
| MHTS: Multi-Hop Tree Structure Framework for Generating Difficulty-Controllable QA Datasets for RAG Evaluation | Mar 29, 2025 | Answer GenerationBenchmarking | —Unverified | 0 |
| A Retrieval-Augmented Knowledge Mining Method with Deep Thinking LLMs for Biomedical Research and Clinical Support | Mar 29, 2025 | Answer GenerationArticles | —Unverified | 0 |
| The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction | Mar 29, 2025 | Answer GenerationMemorization | —Unverified | 0 |
| A Survey of Large Language Model Agents for Question Answering | Mar 24, 2025 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer | Mar 19, 2025 | Answer GenerationMathematical Reasoning | CodeCode Available | 1 |
| RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning | Mar 17, 2025 | Answer GenerationMulti-hop Question Answering | —Unverified | 0 |
| Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets | Mar 12, 2025 | Answer GenerationConversational Search | CodeCode Available | 0 |
| MoEMoE: Question Guided Dense and Scalable Sparse Mixture-of-Expert for Multi-source Multi-modal Answering | Mar 8, 2025 | Answer GenerationMixture-of-Experts | —Unverified | 0 |
| Zero-Shot Complex Question-Answering on Long Scientific Documents | Mar 4, 2025 | Answer Generationdocument understanding | CodeCode Available | 0 |
| Enhancing Multi-hop Reasoning in Vision-Language Models via Self-Distillation with Multi-Prompt Ensembling | Mar 3, 2025 | Answer GenerationComputational Efficiency | —Unverified | 0 |
| EgoNormia: Benchmarking Physical Social Norm Understanding | Feb 27, 2025 | Answer GenerationBenchmarking | CodeCode Available | 1 |
| AgentRM: Enhancing Agent Generalization with Reward Modeling | Feb 25, 2025 | Answer Generation | —Unverified | 0 |
| A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory Texts | Feb 24, 2025 | Answer GenerationInformation Retrieval | CodeCode Available | 0 |
| Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines | Feb 23, 2025 | Answer GenerationLanguage Modeling | —Unverified | 0 |
| Mitigating Lost-in-Retrieval Problems in Retrieval Augmented Multi-Hop Question Answering | Feb 20, 2025 | Answer GenerationMulti-hop Question Answering | —Unverified | 0 |
| TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition | Feb 19, 2025 | Answer GenerationForm | —Unverified | 0 |
| TrustRAG: An Information Assistant with Retrieval Augmented Generation | Feb 19, 2025 | Answer GenerationChunking | CodeCode Available | 5 |
| PeerQA: A Scientific Question Answering Dataset from Peer Reviews | Feb 19, 2025 | answerability predictionAnswer Generation | CodeCode Available | 1 |
| QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval | Feb 12, 2025 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval | Feb 11, 2025 | Answer GenerationQuestion Answering | —Unverified | 0 |
| HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Feb 9, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 0 |