| Towards Generative Abstract Reasoning: Completing Raven's Progressive Matrix via Rule Abstraction and Selection | Jan 18, 2024 | Answer GenerationAttribute | CodeCode Available | 0 | 5 |
| Is ChatGPT a Biomedical Expert? -- Exploring the Zero-Shot Performance of Current GPT Models in Biomedical Tasks | Jun 28, 2023 | Answer GenerationRetrieval | CodeCode Available | 0 | 5 |
| HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Feb 9, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 0 | 5 |
| Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets | Mar 12, 2025 | Answer GenerationConversational Search | CodeCode Available | 0 | 5 |
| RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR Environments | Apr 11, 2025 | Answer GenerationQuestion Answering | CodeCode Available | 0 | 5 |
| GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models | Oct 12, 2023 | Answer GenerationHallucination | CodeCode Available | 0 | 5 |
| Towards Personalized Answer Generation in E-Commerce via Multi-Perspective Preference Modeling | Dec 27, 2021 | Answer GenerationQuestion Answering | CodeCode Available | 0 | 5 |
| MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning | Apr 27, 2024 | Answer GenerationMedical Question Answering | CodeCode Available | 0 | 5 |
| Multilingual State Space Models for Structured Question Answering in Indic Languages | Feb 1, 2025 | Answer GenerationDiversity | CodeCode Available | 0 | 5 |
| Genetic Approach to Mitigate Hallucination in Generative IR | Aug 25, 2024 | Answer GenerationHallucination | CodeCode Available | 0 | 5 |
| Answering Naturally: Factoid to Full length Answer Generation | Nov 1, 2019 | Answer GenerationQuestion Answering | CodeCode Available | 0 | 5 |
| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Jun 14, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 | 5 |
| ELQA: A Corpus of Metalinguistic Questions and Answers about English | May 1, 2022 | Answer GenerationQuestion Answering | CodeCode Available | 0 | 5 |
| AmazonQA: A Review-Based Question Answering Task | Aug 12, 2019 | Answer GenerationInformation Retrieval | CodeCode Available | 0 | 5 |
| Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching | Apr 3, 2025 | Answer GenerationEEG | CodeCode Available | 0 | 5 |
| FairytaleQA Translated: Enabling Educational Question and Answer Generation in Less-Resourced Languages | Jun 6, 2024 | Answer GenerationQuestion Answering | CodeCode Available | 0 | 5 |
| Pre-Trained Neural Language Models for Automatic Mobile App User Feedback Answer Generation | Feb 4, 2022 | Answer GenerationResponse Generation | —Unverified | 0 | 0 |
| Product Answer Generation from Heterogeneous Sources: A New Benchmark and Best Practices | Jan 16, 2022 | Answer GenerationData Augmentation | —Unverified | 0 | 0 |
| Product Answer Generation from Heterogeneous Sources: A New Benchmark and Best Practices | May 1, 2022 | Answer GenerationData Augmentation | —Unverified | 0 | 0 |
| QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval | Jul 29, 2024 | Answer GenerationEvent Extraction | —Unverified | 0 | 0 |
| QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval | Feb 12, 2025 | Answer GenerationInformation Retrieval | —Unverified | 0 | 0 |
| QontSum: On Contrasting Salient Content for Query-focused Summarization | Jul 14, 2023 | Answer GenerationContrastive Learning | —Unverified | 0 | 0 |
| Question-to-Question Retrieval for Hallucination-Free Knowledge Access: An Approach for Wikipedia and Wikidata Question Answering | Jan 20, 2025 | Answer GenerationComputational Efficiency | —Unverified | 0 | 0 |
| RAG-based Question Answering over Heterogeneous Data and Text | Dec 10, 2024 | Answer GenerationKnowledge Graphs | —Unverified | 0 | 0 |
| RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning | Mar 17, 2025 | Answer GenerationMulti-hop Question Answering | —Unverified | 0 | 0 |
| RAGtifier: Evaluating RAG Generation Approaches of State-of-the-Art RAG Systems for the SIGIR LiveRAG Competition | Jun 17, 2025 | Answer GenerationRAG | —Unverified | 0 | 0 |
| RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs | Jul 2, 2024 | Answer GenerationQuestion Answering | —Unverified | 0 | 0 |
| Read before Generate! Faithful Long Form Question Answering with Machine Reading | Mar 1, 2022 | Answer GenerationForm | —Unverified | 0 | 0 |
| ReasonChainQA: Text-based Complex Question Answering with Explainable Evidence Chains | Oct 17, 2022 | Answer GenerationDiversity | —Unverified | 0 | 0 |
| Reliable Text-to-SQL with Adaptive Abstention | Jan 18, 2025 | Answer GenerationText to SQL | —Unverified | 0 | 0 |
| ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval | Feb 11, 2025 | Answer GenerationQuestion Answering | —Unverified | 0 | 0 |
| Retrieval Augmented Generation-Based Incident Resolution Recommendation System for IT Support | Sep 6, 2024 | Answer GenerationLanguage Modeling | —Unverified | 0 | 0 |
| Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines | Feb 23, 2025 | Answer GenerationLanguage Modeling | —Unverified | 0 | 0 |
| Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning | May 31, 2024 | Answer GenerationMultimodal Reasoning | —Unverified | 0 | 0 |
| Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy | May 29, 2023 | Answer GenerationArticles | —Unverified | 0 | 0 |
| Sequencing Matters: A Generate-Retrieve-Generate Model for Building Conversational Agents | Nov 16, 2023 | Answer GenerationRetrieval | —Unverified | 0 | 0 |
| S-Net: From Answer Extraction to Answer Generation for Machine Reading Comprehension | Jun 15, 2017 | Answer GenerationMachine Reading Comprehension | —Unverified | 0 | 0 |
| Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers | Jul 8, 2020 | Answer GenerationGraph Representation Learning | —Unverified | 0 | 0 |
| SUMBot: Summarizing Context in Open-Domain Dialogue Systems | Oct 12, 2022 | Answer GenerationDialogue Generation | —Unverified | 0 | 0 |
| TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition | Feb 19, 2025 | Answer GenerationForm | —Unverified | 0 | 0 |
| Tackling Biomedical Text Summarization: OAQA at BioASQ 5B | Aug 1, 2017 | Answer GenerationClustering | —Unverified | 0 | 0 |
| The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction | Mar 29, 2025 | Answer GenerationMemorization | —Unverified | 0 | 0 |
| The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems | May 24, 2025 | Answer GenerationQuestion Answering | —Unverified | 0 | 0 |
| 1-800-SHARED-TASKS at RegNLP: Lexical Reranking of Semantic Retrieval (LeSeR) for Regulatory Question Answering | Dec 8, 2024 | Answer GenerationDomain Adaptation | —Unverified | 0 | 0 |
| CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks | Feb 2, 2024 | Answer GenerationHallucination | —Unverified | 0 | 0 |
| Towardseffective teaching assistants: From intent-based chatbots to LLM-poweredteachingassistants | Aug 20, 2024 | Answer GenerationChatbot | —Unverified | 0 | 0 |
| Towards Mitigating Hallucination in Large Language Models via Self-Reflection | Oct 10, 2023 | Answer GenerationHallucination | —Unverified | 0 | 0 |
| Towards Retrieval Augmented Generation over Large Video Libraries | Jun 21, 2024 | Answer GenerationQuestion Answering | —Unverified | 0 | 0 |
| Towards Solving Multimodal Comprehension | Apr 20, 2021 | 16kAnswer Generation | —Unverified | 0 | 0 |
| Towards Tractable Mathematical Reasoning: Challenges, Strategies, and Opportunities for Solving Math Word Problems | Oct 29, 2021 | Answer GenerationMath | —Unverified | 0 | 0 |