SOTAVerified

Open-Domain Question Answering

Open-domain question answering is the task of question answering on open-domain datasets such as Wikipedia.

Papers

Showing 351400 of 494 papers

TitleStatusHype
Retrieval Augmented Language Model Pre-Training0
Retrieving and Reading: A Comprehensive Survey on Open-domain Question Answering0
Retrieving Supporting Evidence for Generative Question Answering0
Retrieving Supporting Evidence for LLMs Generated Answers0
Evidentiality-aware Retrieval for Overcoming Abstractiveness in Open-Domain Question Answering0
Revisiting the Open-Domain Question Answering Pipeline0
RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm0
RoseRAG: Robust Retrieval-augmented Generation with Small-scale LLMs via Margin-aware Preference Optimization0
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?0
Tackling Query-Focused Summarization as A Knowledge-Intensive Task: A Pilot Study0
Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions0
Semantic Framework for Comparison Structures in Natural Language0
Semantic Parsing for Single-Relation Question Answering0
Semantic Sentence Composition Reasoning for Multi-Hop Question Answering0
Semantic Technologies in IBM Watson0
Silver Retriever: Advancing Neural Passage Retrieval for Polish Question Answering0
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval0
Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization0
Somm: Into the Model0
SPAGHETTI: Open-Domain Question Answering from Heterogeneous Data Sources with Retrieval and Semantic Parsing0
Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization0
Strong and Efficient Baselines for Open Domain Conversational Question Answering0
Talk to Papers: Bringing Neural Question Answering to Academic Search0
Targeted Extraction of Temporal Facts from Textual Resources for Improved Temporal Question Answering over Knowledge Bases0
Text Embeddings for Retrieval From a Large Knowledge Base0
The Dangers of trusting Stochastic Parrots: Faithfulness and Trust in Open-domain Conversational Question Answering0
The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design0
To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering0
Towards an open-domain conversational system fully based on natural language processing0
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization0
Towards Human-Level Understanding of Complex Process Engineering Schematics: A Pedagogical, Introspective Multi-Agent Framework for Open-Domain Question Answering0
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models0
Towards Universal Dense Retrieval for Open-domain Question Answering0
Tradeoffs in Sentence Selection Techniques for Open-Domain Question Answering0
Training a Ranking Function for Open-Domain Question Answering0
UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering0
UnitedQA: A Hybrid Approach for Open Domain Question Answering0
Unsupervised Open-Domain Question Answering0
Unsupervised Open-Domain Question Answering with Higher Answerability0
Unveiling Knowledge Utilization Mechanisms in LLM-based Retrieval-Augmented Generation0
Watson Discovery Advisor: Question-answering in an industrial setting0
Weaver: Deep Co-Encoding of Questions and Documents for Machine Reading0
WebFAQ: A Multilingual Collection of Natural Q&A Datasets for Dense Retrieval0
What or Who is Multilingual Watson?0
When to Read Documents or QA History: On Unified and Selective Open-domain QA0
Why Does ChatGPT Fall Short in Providing Truthful Answers?0
WikiPassageQA: A Benchmark Collection for Research on Non-factoid Answer Passage Retrieval0
WikiQA: A Challenge Dataset for Open-Domain Question Answering0
WISDOM X, DISAANA and D-SUMM: Large-scale NLP Systems for Analyzing Textual Big Data0
XLMRQA: Open-Domain Question Answering on Vietnamese Wikipedia-based Textual Knowledge Source0
Show:102550
← PrevPage 8 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1somebodyKILT-RL2.62Unverified
2WikipediaKILT-RL2.46Unverified
3arxiv.org/abs/2103.06332KILT-RL2.36Unverified
4BART + DPRKILT-RL1.9Unverified
5RAGKILT-RL1.69Unverified
6Training Set Retrieval (top 1)KILT-RL0Unverified
7T5-baseKILT-RL0Unverified
8Input CopyingKILT-RL0Unverified
9SphereKILT-RL0Unverified
10Random Training Set AnswerKILT-RL0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM43.56Unverified
2intersectKILT-EM38.78Unverified
3KGI_0KILT-EM36.36Unverified
4WikipediaKILT-EM35.32Unverified
5RAGKILT-EM32.69Unverified
6BERT + DPRKILT-EM31.99Unverified
7BART + DPRKILT-EM30.06Unverified
8Multitask DPR + BARTKILT-EM29.09Unverified
9Multi-task DPRKILT-EM0Unverified
10SphereKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM57.91Unverified
2intersectKILT-EM50.56Unverified
3WikipediaKILT-EM45.55Unverified
4KGI_0KILT-EM42.85Unverified
5Multitask DPR + BARTKILT-EM42.36Unverified
6RAGKILT-EM38.13Unverified
7BERT + DPRKILT-EM34.48Unverified
8BART + DPRKILT-EM31.4Unverified
9TABiKILT-EM0Unverified
10T5-baseKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1intersectKILT-EM18.06Unverified
2WikipediaKILT-EM11.71Unverified
3Multitask DPR + BARTKILT-EM9.53Unverified
4RAGKILT-EM3.21Unverified
5BART + DPRKILT-EM1.96Unverified
6BERT + DPRKILT-EM0.74Unverified
7SphereKILT-EM0Unverified
8Multi-task DPRKILT-EM0Unverified
9GENREKILT-EM0Unverified
10chriskueiKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1SpanBERTF184.8Unverified
2Cluster-Former (#C=512)EM68Unverified
3Locality-Sensitive HashingEM66Unverified
4Multi-passage BERTEM65.1Unverified
5Sparse AttentionEM64.7Unverified
6DECAPROPEM62.2Unverified
7Bi-Attention + DCU-LSTMN-gram F159.5Unverified
8Denoising QAEM58.8Unverified
9DecaPropEM56.8Unverified
10AMANDAN-gram F156.6Unverified
#ModelMetricClaimedVerifiedStatus
1Fourier TransformerRouge-L26.9Unverified
2QGRouge-L26.4Unverified
3BARTRouge-L24.3Unverified
4E-MCARouge-L24Unverified
5Transformer Multitask + LayerDropRouge-L23.4Unverified
6Multi-InrerleaveRouge-L14.63Unverified
#ModelMetricClaimedVerifiedStatus
1Evidence Aggregation via R^3 Re-RankingEM (Quasar-T)42.3Unverified
2Denoising QAEM (Quasar-T)42.2Unverified
3DecaPropEM (Quasar-T)38.6Unverified
4R^3EM (Quasar-T)35.3Unverified
5GAEM (Quasar-T)26.4Unverified
6BiDAFEM (Quasar-T)25.9Unverified
#ModelMetricClaimedVerifiedStatus
1FiEExact Match58.4Unverified
2R2-D2 HN-DPRExact Match55.9Unverified
3UniK-QAExact Match54.9Unverified
4UnitedQA (Hybrid)Exact Match54.7Unverified
5BPR (linear scan; l=1000)Exact Match41.6Unverified
#ModelMetricClaimedVerifiedStatus
1SPARTAEM59.3Unverified
2Blended RAGEM57.63Unverified
3BERTseriniEM50.2Unverified
4BERTseriniEM38.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match57.7Unverified
2FiE+PAQExact Match56.3Unverified
3FiEExact Match52.4Unverified
4EMDR2Exact Match48.7Unverified
#ModelMetricClaimedVerifiedStatus
1DrQAEM70Unverified
2DCNEM66.2Unverified
3MPCMEM65.5Unverified
#ModelMetricClaimedVerifiedStatus
1ERNIE 2.0 LargeEM64.2Unverified
2ERNIE 2.0 BaseEM61.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match65.5Unverified
2BPR (linear scan; l=1000)Exact Match56.8Unverified
#ModelMetricClaimedVerifiedStatus
1EMDR2Exact Match52.5Unverified
#ModelMetricClaimedVerifiedStatus
1UnitedQA (Hybrid)Exact Match70.5Unverified