SOTAVerified

Open-Domain Question Answering

Open-domain question answering is the task of question answering on open-domain datasets such as Wikipedia.

Papers

Showing 401450 of 494 papers

TitleStatusHype
xMoCo: Cross Momentum Contrastive Learning for Open-Domain Question Answering0
You Only Need One Model for Open-domain Question Answering0
Zero-shot cross-lingual open domain question answering0
Down and Across: Introducing Crossword-Solving as a New NLP BenchmarkCode0
A Question-Focused Multi-Factor Attention Network for Question AnsweringCode0
What’s in a Name? Answer Equivalence For Open-Domain Question AnsweringCode0
SearchQA: A New Q&A Dataset Augmented with Context from a Search EngineCode0
Scaling Reasoning can Improve Factuality in Large Language ModelsCode0
Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation SystemsCode0
Detrimental Contexts in Open-Domain Question AnsweringCode0
When to Retrieve: Teaching LLMs to Utilize Information Retrieval EffectivelyCode0
TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal PredictionCode0
Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge BaseCode0
Answer Complex Questions: Path Ranker Is All You NeedCode0
Bidirectional Attention Flow for Machine ComprehensionCode0
Open-Domain Question-Answering for COVID-19 and Other Emergent DomainsCode0
Open-Domain Event Detection using Distant SupervisionCode0
Open Domain Question Answering over Tables via Dense RetrievalCode0
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise TasksCode0
Semi-Parametric Retrieval via Binary Bag-of-Tokens IndexCode0
Open Domain Question Answering Using Early Fusion of Knowledge Bases and TextCode0
Sequence to sequence pretraining for a less-resourced Slovenian languageCode0
NIR-Prompt: A Multi-task Generalized Neural Information Retrieval Training FrameworkCode0
Detecting Temporal Ambiguity in QuestionsCode0
Analysing the Robustness of Dual Encoders for Dense Retrieval Against MisspellingsCode0
Neural Natural Logic Inference for Interpretable Question AnsweringCode0
Beyond Contrastive Learning: A Variational Generative Model for Multilingual RetrievalCode0
DESIRE-ME: Domain-Enhanced Supervised Information REtrieval using Mixture-of-ExpertsCode0
Neural Arabic Question AnsweringCode0
MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question AnsweringCode0
Dense Passage Retrieval in Conversational SearchCode0
Benchmarking LLM-based Relevance Judgment MethodsCode0
What Does My QA Model Know? Devising Controlled Probes using Expert KnowledgeCode0
Combining Lexical and Dense Retrieval for Computationally Efficient Multi-hop Question AnsweringCode0
Passage-specific Prompt Tuning for Passage Reranking in Question Answering with Large Language ModelsCode0
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question AnsweringCode0
Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic SupervisionCode0
Multi-step Retriever-Reader Interaction for Scalable Open-domain Question AnsweringCode0
MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQLCode0
SpanBERT: Improving Pre-training by Representing and Predicting SpansCode0
Multi-Hop Paragraph Retrieval for Open-Domain Question AnsweringCode0
Sparse, Dense, and Attentional Representations for Text RetrievalCode0
Monolingual or Multilingual Instruction Tuning: Which Makes a Better AlpacaCode0
Denoising Table-Text Retrieval for Open-Domain Question AnsweringCode0
Denoising Distantly Supervised Open-Domain Question AnsweringCode0
SPLADE: Sparse Lexical and Expansion Model for First Stage RankingCode0
Mention Memory: incorporating textual knowledge into Transformers through entity mention attentionCode0
R^3: Reinforced Reader-Ranker for Open-Domain Question AnsweringCode0
Defending Against Disinformation Attacks in Open-Domain Question AnsweringCode0
RA-MTR: A Retrieval Augmented Multi-Task Reader based Approach for Inspirational Quote Extraction from Long DocumentsCode0
Show:102550
← PrevPage 9 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1somebodyKILT-RL2.62Unverified
2WikipediaKILT-RL2.46Unverified
3arxiv.org/abs/2103.06332KILT-RL2.36Unverified
4BART + DPRKILT-RL1.9Unverified
5RAGKILT-RL1.69Unverified
6Training Set Retrieval (top 1)KILT-RL0Unverified
7T5-baseKILT-RL0Unverified
8Input CopyingKILT-RL0Unverified
9SphereKILT-RL0Unverified
10Random Training Set AnswerKILT-RL0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM43.56Unverified
2intersectKILT-EM38.78Unverified
3KGI_0KILT-EM36.36Unverified
4WikipediaKILT-EM35.32Unverified
5RAGKILT-EM32.69Unverified
6BERT + DPRKILT-EM31.99Unverified
7BART + DPRKILT-EM30.06Unverified
8Multitask DPR + BARTKILT-EM29.09Unverified
9Multi-task DPRKILT-EM0Unverified
10SphereKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1Re2GKILT-EM57.91Unverified
2intersectKILT-EM50.56Unverified
3WikipediaKILT-EM45.55Unverified
4KGI_0KILT-EM42.85Unverified
5Multitask DPR + BARTKILT-EM42.36Unverified
6RAGKILT-EM38.13Unverified
7BERT + DPRKILT-EM34.48Unverified
8BART + DPRKILT-EM31.4Unverified
9TABiKILT-EM0Unverified
10T5-baseKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1intersectKILT-EM18.06Unverified
2WikipediaKILT-EM11.71Unverified
3Multitask DPR + BARTKILT-EM9.53Unverified
4RAGKILT-EM3.21Unverified
5BART + DPRKILT-EM1.96Unverified
6BERT + DPRKILT-EM0.74Unverified
7SphereKILT-EM0Unverified
8Multi-task DPRKILT-EM0Unverified
9GENREKILT-EM0Unverified
10chriskueiKILT-EM0Unverified
#ModelMetricClaimedVerifiedStatus
1SpanBERTF184.8Unverified
2Cluster-Former (#C=512)EM68Unverified
3Locality-Sensitive HashingEM66Unverified
4Multi-passage BERTEM65.1Unverified
5Sparse AttentionEM64.7Unverified
6DECAPROPEM62.2Unverified
7Bi-Attention + DCU-LSTMN-gram F159.5Unverified
8Denoising QAEM58.8Unverified
9DecaPropEM56.8Unverified
10AMANDAN-gram F156.6Unverified
#ModelMetricClaimedVerifiedStatus
1Fourier TransformerRouge-L26.9Unverified
2QGRouge-L26.4Unverified
3BARTRouge-L24.3Unverified
4E-MCARouge-L24Unverified
5Transformer Multitask + LayerDropRouge-L23.4Unverified
6Multi-InrerleaveRouge-L14.63Unverified
#ModelMetricClaimedVerifiedStatus
1Evidence Aggregation via R^3 Re-RankingEM (Quasar-T)42.3Unverified
2Denoising QAEM (Quasar-T)42.2Unverified
3DecaPropEM (Quasar-T)38.6Unverified
4R^3EM (Quasar-T)35.3Unverified
5GAEM (Quasar-T)26.4Unverified
6BiDAFEM (Quasar-T)25.9Unverified
#ModelMetricClaimedVerifiedStatus
1FiEExact Match58.4Unverified
2R2-D2 HN-DPRExact Match55.9Unverified
3UniK-QAExact Match54.9Unverified
4UnitedQA (Hybrid)Exact Match54.7Unverified
5BPR (linear scan; l=1000)Exact Match41.6Unverified
#ModelMetricClaimedVerifiedStatus
1SPARTAEM59.3Unverified
2Blended RAGEM57.63Unverified
3BERTseriniEM50.2Unverified
4BERTseriniEM38.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match57.7Unverified
2FiE+PAQExact Match56.3Unverified
3FiEExact Match52.4Unverified
4EMDR2Exact Match48.7Unverified
#ModelMetricClaimedVerifiedStatus
1DrQAEM70Unverified
2DCNEM66.2Unverified
3MPCMEM65.5Unverified
#ModelMetricClaimedVerifiedStatus
1ERNIE 2.0 LargeEM64.2Unverified
2ERNIE 2.0 BaseEM61.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniK-QAExact Match65.5Unverified
2BPR (linear scan; l=1000)Exact Match56.8Unverified
#ModelMetricClaimedVerifiedStatus
1EMDR2Exact Match52.5Unverified
#ModelMetricClaimedVerifiedStatus
1UnitedQA (Hybrid)Exact Match70.5Unverified