SOTAVerified

Multi-hop Question Answering

Papers

Showing 125 of 202 papers

TitleStatusHype
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language ModelsCode7
Retrieval-Augmented Generation with Hierarchical KnowledgeCode4
Husky: A Unified, Open-Source Language Agent for Multi-Step ReasoningCode3
Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical PerceptionCode3
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsCode2
EfficientRAG: Efficient Retriever for Multi-Hop Question AnsweringCode2
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model ReasoningCode2
LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented SearchersCode2
MASKSEARCH: A Universal Pre-Training Framework to Enhance Agentic Search CapabilityCode2
Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base EmbeddingsCode1
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question AnsweringCode1
Can Language Models Solve Graph Problems in Natural Language?Code1
HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual DataCode1
Improving Embedded Knowledge Graph Multi-hop Question Answering by introducing Relational Chain ReasoningCode1
Analyzing the Effectiveness of the Underlying Reasoning Tasks in Multi-hop Question AnsweringCode1
HopWeaver: Synthesizing Authentic Multi-Hop Questions Across Text CorporaCode1
Grounded Multi-Hop VideoQA in Long-Form Egocentric VideosCode1
End-to-End Beam Retrieval for Multi-Hop Question AnsweringCode1
Dynamic Semantic Graph Construction and Reasoning for Explainable Multi-hop Science Question AnsweringCode1
Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question AnsweringCode1
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning StepsCode1
Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-GenerationCode1
Answering Questions by Meta-Reasoning over Multiple Chains of ThoughtCode1
DeepEdit: Knowledge Editing as Decoding with ConstraintsCode1
Avoiding Reasoning Shortcuts: Adversarial Evaluation, Training, and Model Development for Multi-Hop QACode1
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Multi-hop Dense Passage Retriever (MDR)Answer F156.5Unverified
#ModelMetricClaimedVerifiedStatus
1Beam RetrievalAn69.2Unverified